Bone anti-resorptive compounds

ABSTRACT

The present invention relates to polypeptides that bind to RANK and comprise amino acid sequences of RANKL external surface loops. The invention also relates to fragments, analogs, and derivatives of such polypeptides.  
     The present invention further relates to compositions comprising the polypeptides described herein. Also included are methods for inhibiting osteoclast differentiation, methods for inhibiting bone resorption, and methods for competitively inhibiting RANKL activity. The invention also provides methods for treating diseases or conditions which are at least partially characterized by loss of bone mass.

[0001] This application is related to and claims the benefit of the following U.S. applications, which are incorporated herein by reference: Ser. No. 60/277,855 filed Mar. 22, 2001; Ser. No. 10/105,057 filed Mar. 22, 2002; Ser. No. 60/311,163 filed Aug. 9, 2001; Ser. No. 10/215,446 filed Aug. 9, 2002; Ser. No. 60/329,231 filed Oct. 12, 2001; Ser. No. 60/329,393 filed Oct. 15, 2001; Ser. No. 60/329,360 filed Oct. 15, 2001; Ser. No. 60/328,876 filed Oct. 12, 2001; U.S. non-provisional entitled RANKL Mimics and Uses Thereof, Lam, et al. filed Oct. 15, 2002; and U.S. non-provisional entitled Methods for Screening Osteogenic Compounds, Lam, et al. filed Oct. 15, 2002.

[0002] This invention was made in part with Government support under National Institutes of Health Grants AR32788, AR46123 and DE05413. The Government has certain rights in the invention.

FIELD OF THE INVENTION

[0003] The present invention relates to polypeptides that bind to RANK and whose amino acid sequences comprise one or more external surface loops AA″, CD, EF, and DE of RANKL. Further provided are fragments, analogs, and derivatives of said polypeptides.

[0004] The invention also relates to pharmaceutical compositions comprising the polypeptides provided herein, and/or fragments, analogs, and derivatives thereof. The invention further provides methods for inhibiting osteoclast differentiation and methods for competitively inhibiting RANKL comprising administering an effective amount of said compositions. Also provided are methods for inhibiting bone resorption and methods for treating diseases or conditions which are at least partially characterized by loss of bone mass.

BACKGROUND

[0005] Various conditions and diseases which manifest themselves in bone loss or thinning are a critical and growing health concern. It has been estimated that as many as 30 million Americans and 100 million worldwide are at risk for osteoporosis alone. Mundy et al., Science, 286: 1946-1949 (1999). Other conditions known to involve bone loss include juvenile osteoporosis, osteogenesis imperfecta, hypercalcemia, hyperparathyroidism, osteomalacia, osteohalisteresis, osteolytic bone disease, osteonecrosis, Paget's disease of bone, bone loss due to rheumatoid arthritis, inflammatory arthritis, osteomyelitis, corticosteroid treatment, metastatic bone diseases, periodontal bone loss, bone loss due to cancer, age-related loss of bone mass, and other forms of osteopenia. Additionally, new bone formation is needed in many situations, e.g., to facilitate bone repair or replacement for bone fractures, bone defects, plastic surgery, dental and other implantations and in other such contexts.

[0006] Bone is a dense, specialized form of connective tissue. Bone matrix is formed by osteoblast cells located at or near the surface of existing bone matrix. Bone is resorbed (eroded) by another cell type known as the osteoclast (a type of macrophage). These cells secrete acids, which dissolve bone minerals, and hydrolases, which digest its organic components. Thus, bone formation and remodeling is a dynamic process involving an ongoing interplay between the creation and erosion activities of osteoblasts and osteoclasts. Alberts, et al., Molecular Biology of the Cell, Garland Publishing, New York (3rd ed. 1994), pp. 1182-1186.

[0007] Present forms of clinically-approved bone loss therapy are primarily anti-resorptive, in that they inhibit bone resorption processes. Among the agents which have been used or suggested for treatment of osteoporosis because of their claimed ability to inhibit bone resorption are estrogen, selective estrogen receptor modulators (SERMs), calcium, calcitriol, calcitonin (Sambrook, P. et al., N.Engl.J.Med. 328:1747-1753), alendronate (Saag, K. et al., N.Engl.J.Med. 339:292-299) and other bisphosphonates. Luckman et al., J. Bone Min. Res. 13, 581 (1998). However, currently-available anti-resorptives may have undesired effects relating to their impact on the inhibition of bone resorption/remodeling or other unwanted side effects.

[0008] As a result, it would be very desirable to obtain other compounds for treatments of bone loss diseases. A key development in the field of bone cell biology is the recent discovery that RANK ligand (RANKL, also known as osteoprotegerin ligand (OPGL), TNF-related activation induced cytokine (TRANCE), and osteoclast differentiation factor (ODF)), expressed on stromal cells, osteoblasts, activated T-lymphocytes and mammary epithelium, is the unique molecule essential for differentiation of macrophages into osteoclasts. Lacey, et al., Cell 93: 165-176 (1998) (Osteoprotegerin Ligand Is a Cytokine that Regulates Osteoclast Differentiation and Activation.) The cell surface receptor for RANKL is RANK, Receptor Activator of Necrosis Factor (NF)-kappa B. RANKL is a type-2 transmembrane protein with an intracellular domain of less than about 50 amino acids, a transmembrane domain of about 21 amino acids, and an extracellular domain of about 240 to 250 amino acids. RANKL exists naturally in transmembrane and soluble forms. The deduced amino acid sequence for at least the murine, rat and human forms of RANKL and variants thereof are known. See e.g., Anderson, et al., U.S. Pat. No. 6,017,729, Boyle, U.S. Pat. No. 5,843,678, and Xu J. et al., J. Bone Min. Res. (2000/15:2178) which are incorporated herein by reference. Furthermore, we have solved the crystal structure of RANKL ectodomain, as disclosed in application Ser. No. 60/311,163, filed Aug. 9, 2001 and Ser. No. 10/215,446 filed Aug. 9, 2002.

[0009] RANKL (OPGL) has been identified as a potent inducer of bone resorption and as a positive regulator of osteoclast development. Lacey et al., supra. In addition to its role as a factor in osteoclast differentiation and activation, RANKL has been reported to induce human dendritic cell (DC) cluster formation. Anderson et al., supra and mammary epithelium development J. Fata et al., “The osteoclast differentiation factor osteoprotegerin ligand is essential for mammary gland development,” Cell, 103:41-50 (2000). Recently, we have determined that RANKL plays a role in anabolic bone formation processes and can be utilized in methods for stimulation of osteoblast proliferation or bone nodule mineralization, as disclosed in applications Ser. No. 60/277,855, filed Mar. 22, 2001 and Ser. No. 10,105,057 filed Mar. 22, 2002. We have also recently determined the precise three-dimensional structure of RANKL's ectodomain, and located the amino acid sequences of RANKL's unique external surface loops which interact with RANK, as disclosed in applications Ser No. 60/311, 163, filed Aug. 9, 2001 and Ser. No. 10/215,446 filed Aug. 9, 2002.

[0010] Accordingly, due to the limited success in treatment of bone loss disorders with current medications, a need exists to develop novel therapeutics for treatment of such conditions.

SUMMARY OF THE INVENTION

[0011] Accordingly, among the objects of the invention is the provision of polypeptides comprising one or more of the external surface loops AA″, CD, DE, or EF of RANKL and having the ability to bind to RANK. The RANKL loops correspond to the portions of RANKL molecule (SEQ ID NO 6) described below:

[0012] AA″ contains amino acid residues 170-193 (SEQ ID NO 2),

[0013] CD contains amino acid residues 224-233 (SEQ ID NO 3),

[0014] DE contains amino acid residues 245-251 (SEQ ID NO 4), and

[0015] EF contains amino acid residues 261-269 (SEQ ID NO 5).

[0016] A polypeptide containing a portion of AA″ loop sequence is also included in the invention, and it includes amino acid residues 175-185 (SEQ ID NO 1). Polypeptides of SEQ ID NO: 7 and SEQ ID NO: 11 are natural occurring variants of the AA″ loop of human RANKL. Both are natural variants of human RANKL. SEQ ID NO: 8, SEQ ID NO: 9 and SEQ ID NO: 10 are surface loop polypeptide sequences of human RANKL loops CD, DE and EF respectively.

[0017] In another aspect, the invention encompasses polypeptides that bind to RANK and consist of sequences selected from the group consisting of SEQ ID NO 1-SEQ ID NO 5 and SEQ ID NO: 7-SEQ ID NO: 11.

[0018] It is another object of the invention to provide polypeptides comprising external surface loops of RANKL and having the ability to competitively inhibit RANKL.

[0019] It is a further object of the invention to provide fragments, analogs, and derivatives of such polypeptides. The methods for obtaining such fragments, analogs, and derivatives are described herein.

[0020] In another aspect, the present invention provides pharmaceutical compositions comprising the polypeptides. The compositions may further include pharmaceutically acceptable carriers, adjuvants, solubilizers, stabilizers, and/or anti-oxidants.

[0021] The invention also encompasses methods for inhibiting osteoclast differentiation, methods for competitively inhibiting RANKL, and methods for inhibiting bone resorption. Such methods include administration of compositions of the invention. Further provided are methods for treating diseases and conditions which are at least in part characterized by loss of bone mass. In a preferred embodiment, such methods are used to treat osteoporosis, osteolytic bone disease, rheumatoid arthritis, and skeletal metastasis.

[0022] Other objects and features will be in part apparent and in part pointed out hereinafter.

BRIEF DESCRIPTION OF FIGURES

[0023]FIG. 1 is a graph depicting the effect of increasing concentration of PSGSHKVTLSS peptide (SEQ ID NO 1) on osteoclast generation as measured by TRAP activity.

[0024] Abbreviations and Definitions

[0025] To facilitate understanding of the invention, a number of terms are defined below:

[0026] The amino acid notations used herein for the twenty genetically encoded L-amino acids are conventional and are abbreviated as follows: One Letter Three-Letter Amino Acid Symbol Symbol Alanine A Ala Arginine R Arg Asparagine N Asn Aspartic D Asp acid Cysteine C Cys Glutamine Q Gln Glutamicacid E Glu Glycine G Gly Histidine H His Isoleucine I Ile Leucine L Leu Lysine K Lys Methionine M Met Phenylalanine F Phe Proline P Pro Serine S Ser Threonine T Thr Tryptophan W Trp Tyrosine Y Tyr Valine V Val

[0027] As used herein, unless specifically delineated otherwise, the three-letter and one-letter amino acid abbreviations designate amino acids in either the D-configuration or the L-configuration. For example, Arg designates D-arginine and L-arginine, and R designates D-arginine and L-arginine.

[0028] Unless noted otherwise, when polypeptide sequences are presented as a series of one-letter and/or three-letter abbreviations, the sequences are presented in the N→C direction, in accordance with common practice. As used herein, “C” refers to the alpha carbon of an amino acid residue. For purposes of determining conservative amino acid substitutions in the various polypeptides described herein and for describing the various peptide and peptide analog compounds, the amino acids can be conveniently classified into two main categories—hydrophilic and hydrophobic—depending primarily on the physical-chemical characteristics of the amino acid side chain. These two main categories can be further classified into subcategories that more distinctly define the characteristics of the amino acid side chains. For example, the class of hydrophilic amino acids can be further subdivided into acidic, basic and polar amino acids. The class of hydrophobic amino acids can be further subdivided into non-polar and aromatic amino acids. The definitions of the various categories of amino acids are as follows:

[0029] “Hydrophilic amino acid” refers to an amino acid exhibiting a hydrophobicity of less than zero according to the normalized consensus hydrophobicity scale of Eisenberg et al., 1984, J. Mol. Biol. 179:125-142. Genetically encoded hydrophilic amino acids include Thr (T), Ser (S), His (H), Glu (E), Asn (N), Gln (Q), Asp (D), Lys (K) and Arg (R).

[0030] “Acidic amino acid” refers to a hydrophilic amino acid having a side chain pK value of less than 7. Acidic amino acids typically have negatively charged side chains at physiological pH due to loss of a hydrogen ion. Genetically encoded acidic amino acids include Glu (E) and Asp (D).

[0031] “Basic amino acid” refers to a hydrophilic amino acid having a side chain pK value of greater than 7. Basic amino acids typically have positively charged side chains at physiological pH due to association with hydronium ion. Genetically encoded basic amino acids include His (H), Arg (R) and Lys (K).

[0032] “Polar amino acid” refers to a hydrophilic amino acid having a side chain that is uncharged at physiological pH, but which has at least one bond in which the pair of electrons shared in common by two atoms is held more closely by one of the atoms. Genetically encoded polar amino acids include Asn (N), Gln (Q) Ser (S) and Thr (T).

[0033] “Hydrophobic amino acid” refers to an amino acid exhibiting a hydrophobicity of greater than zero according to the normalized consensus hydrophobicity scale of Eisenberg, 1984, J. Mol. Biol. 179:125-142. Genetically encoded hydrophobic amino acids include Pro (P), Lle (I), Phe (F), Val (V), Leu (L), Trp (W), Met (M), Ala (A), Gly (G) and Tyr (Y).

[0034] “Aromatic amino acid” refers to a hydrophobic amino acid with a side chain having at least one aromatic or heteroaromatic ring. The aromatic or heteroaromatic ring may contain one or more substituents such as —OH, —SH, —CN, —F, —Cl, —Br, —I, —NO₂, —NO, —NH₂, —NHR, —NRR, —C(O)R, —C(O)OH, —C(O)OR, —C(O)NH₂, —C(O)NHR, —C(O)NRR and the like where each R is independently (C₁-C₆) alkyl, substituted (C₁-C₆) alkyl, (C₂-C₆) alkenyl, substituted (C₂-C₆) alkenyl, (C₂-C₆) alkynyl, substituted (C₂ C₆) alkynyl, (C₅-C₂₀) aryl, substituted (C₅-C₂₀) aryl, (C₆-C₂₆) arylalkyl, substituted (C₆-C₂₆) arylalkyl, 5-20 membered heteroaryl, substituted 5-20 membered heteroaryl, 6-26 membered heteroarylalkyl or substituted 6-26 membered heteroarylalkyl. Genetically encoded aromatic amino acids include His (H), Phe (F), Tyr (Y) and Trp (W).

[0035] “Apolar amino acid” refers to a hydrophobic amino acid having a side chain that is uncharged at physiological pH and which has bonds in which the pair of electrons shared in common by two atoms is generally held equally by each of the two atoms (i.e., the side chain is not polar). Genetically encoded apolar amino acids include Leu (L), Val (V), Ile (I), Met (M), Gly (G) and Ala (A).

[0036] “Aliphatic amino acid” refers to a hydrophobic amino acid having an aliphatic hydrocarbon side chain. Genetically encoded aliphatic amino acids include Ala (A), Val (V), Leu (L) and Ile (I).

[0037] “Hydroxyl-substituted aliphatic amino acid” refers to a hydrophilic polar amino acid having a hydroxyl-substituted side chain. Genetically-encoded hydroxyl-substituted aliphatic amino acids include Ser (S) and Thr (T).

[0038] The amino acid residue Cys (C) is unusual in that it can form disulfide bridges with other Cys (C) residues or other sulfanyl-containing amino acids. The ability of Cys (C) residues (and other amino acids with —SH containing side chains) to exist in a peptide in either the reduced free —SH or oxidized disulfide-bridged form affects whether Cys (C) residues contribute net hydrophobic or hydrophilic character to a peptide. While Cys (C) exhibits a hydrophobicity of 0.29 according to the normalized consensus scale of Eisenberg (Eisenberg, 1984, supra), it is to be understood that for purposes of the present invention Cys (C) is categorized as a polar hydrophilic amino acid, notwithstanding the general classifications defined above.

[0039] As will be appreciated by those of skill in the art, the above-defined categories are not mutually exclusive. Thus, amino acids having side chains exhibiting two or more physical-chemical properties can be included in multiple categories. For example, amino acid side chains having aromatic moieties that are further substituted with polar substituents, such as Tyr (Y), may exhibit both aromatic hydrophobic properties and polar or hydrophilic properties, and can therefore be included in both the aromatic and polar categories. As another example, His (H) has a side chain that falls within the aromatic and basic categories. The appropriate categorization of any amino acid will be apparent to those of skill in the art, especially in light of the detailed disclosure provided herein.

[0040] While the above-defined categories have been exemplified in terms of the genetically encoded amino acids, the amino acid substitutions need not be, and in certain embodiments preferably are not, restricted to the genetically encoded amino acids. Indeed, since many of the compounds described herein may be produced synthetically, they may comprise one or more genetically non-encoded amino acids. Thus, in addition to the naturally occurring genetically encoded amino acids, amino acid residues in the core peptides of structure (1) may be substituted with naturally occurring non-encoded amino acids and synthetic amino acids.

[0041] Certain commonly encountered amino acids of which the compounds of the invention may be comprised include, but are not limited to, β-alanine (β-Ala) and other omega-amino acids such as 3-aminopropionic acid, 2,3-diaminopropionic acid (Dpr), 4-aminobutyric acid and so forth; α-aminoisobutyric acid (Aib); ε-aminohexanoic acid (Aha); δ-aminovaleric acid (Ava); N-methylglycine or sarcosine (MeGly); ornithine (Orn); citrulline (Cit); t-butylalanine (t-BuA); t-butylglycine (t-BuG); N-methylisoleucine (MeIle); phenylglycine (Phg); cyclohexylalanine (Cha); norleucine (Nle); naphthylalanine (Nal); 4-chlorophenylalanine (Phe(4-Cl)); 2-fluorophenylalanine (Phe(2-F)); 3-fluorophenylalanine (Phe(3-F)); 4-fluorophenylalanine (Phe(4-F)); penicillamine (Pen); 1,2,3,4-tetrahydroisoquinoline-3-carboxylic acid (Tic);β-2-thienylalanine (Thi); methionine sulfoxide (MSO); homoarginine (hArg); N-acetyl lysine (AcLys); 2,4-diaminobutyric acid (Dbu); 2,3-diaminobutyric acid (Dab); p-aminophenylalanine (Phe(pNH₂)); N-methyl valine (MeVal); homocysteine (hCys), homophenylalanine (hPhe) and homoserine (hSer); hydroxyproline (Hyp), homoproline (hPro), N-methylated amino acids and peptoids (N-substituted glycines).

[0042] The classifications of the genetically encoded and common non-encoded amino acids according to the categories defined above are summarized in Table 3, below. It is to be understood that Table 3 is for illustrative purposes only and does not purport to be an exhaustive list of amino acid residues that can be used in the invention. Additional amino acids may be found in Fasman, 1989, Practical Handbook of Biochemistry and Molecular Biology, CRC Press, Inc., pp. 3-70, and the references cited therein.

[0043] As used herein, a “recombinant nucleic acid” is defined either by its method of production or its structure. In reference to its method of production, e.g., a product made by a process, the process is use of recombinant nucleic acid techniques, e.g., involving human intervention in the nucleotide sequence, typically selection or production. Alternatively, it can be a nucleic acid made by generating a sequence comprising fusion of two fragments which are not naturally contiguous to each other, but is meant to exclude products of nature, e.g., naturally occurring mutants. Thus, for example, products made by transforming cells with any unnaturally occurring vector is encompassed, as are nucleic acids comprising sequences derived using any synthetic oligonucleotide process. Such is often done to replace a codon with a redundant codon encoding the same or a conservative amino acid, while typically introducing or removing a sequence recognition site. Alternatively, it is performed to join together nucleic acid segments of desired functions to generate a single genetic entity comprising a desired combination of functions not found in the commonly available natural forms. Restriction enzyme recognition sites are often the target of such artificial manipulations, but other site specific targets, e.g., promoters, DNA replication sites, regulation sequences, control sequences, or other useful features may be incorporated by design.

[0044] As used herein, “polynucleotide” and “oligonucleotide” are used interchangeably and mean a polymer of at least 2 nucleotides joined together by phosphodiester bonds and may consist of either ribonucleotides or deoxyribonucleotides.

[0045] As used herein, “sequence” means the linear order in which monomers occur in a polymer, for example, the order of amino acids in a polypeptide or the order of nucleotides in a polynucleotide.

[0046] As used herein, “peptide”, “polypeptide” and “protein” are used interchangeably and mean a compound that consists of two or more amino acids that are linked by means of peptide bonds.

[0047] As used herein “recombinant protein” means that the protein, whether comprising a native or mutant primary amino acid sequence, is obtained by expression of a gene carried by a recombinant DNA molecule in a cell other than the cell in which that gene and/or protein is naturally found. In other words, the gene is heterologous to the host in which it is expressed. It should be noted that any alteration of a gene, including the addition of a polynucleotide encoding an affinity purification moiety to the gene, makes that gene unnatural for the purposes of this definition, and thus that gene cannot be ‘naturally’ found in any cell.

[0048] As used herein, “mutein” includes fragments, derivatives, and analogs of polypeptides.

[0049] As used herein, “RANK” refers to RANK protein, recombinant RANK proteins, RANK fusion proteins, analogs, derivatives and mimics thereof.

[0050] As used herein, the term “animal” includes human beings.

[0051] The phrase “preventing or inhibiting” is being affected either by being inhibited, expressed in another manner, or reduced to such an extent that the observed property is measurably lower than is the case when the treatment is not employed. Measurement of the degree of inhibition can be determined in vitro by methods known to the person skilled in the art.

[0052] By the term “an effective amount” is meant an amount of the substance in question which produces a statistically significant effect. For example, an “effective amount” for therapeutic uses is the amount of the composition comprising an active compound herein required to provide a clinically significant increase in healing rates in fracture repair; reversal or inhibition of bone loss in osteoporosis; prevention or delay of onset of osteoporosis; repair or prevention of dental defects; or treatment or inhibition of other bone loss conditions, diseases or defects, including but not limited to those discussed herein above. Such effective amounts will be determined using routine optimization techniques and are dependent on the particular condition to be treated, the condition of the patient, the route of administration, the formulation, and the judgment of the practitioner and other factors evident to those skilled in the art. The dosage required for the compounds of the invention (for example, in osteoporosis) is manifested as that which induces a statistically significant difference in bone mass between treatment and control groups. This difference in bone mass may be seen, for example, as at least 1-2%, or any clinically significant enhancement in bone mass for the treatment group. Other measurements of clinically significant increases in healing may include, for example, tests for breaking strength and tension, breaking strength and torsion, 4-point bending, and other biomechanical tests well known to those skilled in the art. General guidance for treatment regimens is obtained from the experiments carried out in animal models of the disease of interest.

[0053] As used herein, “treatment” includes both prophylaxis and therapy. Thus, in treating a subject, the compounds of the invention may be administered to a subject already suffering from loss of bone mass or to prevent or inhibit the occurrence of such condition.

DETAILED DESCRIPTION

[0054] In accordance with the present invention, applicants have discovered that a polypeptide whose sequence represents a subset of the AA″ loop of RANKL acts as a competitive antagonist of RANKL by preventing RANKL from inducing cellular differentiation of osteoclast precursors. Furthermore, the observed inhibition of osteoclast differentiation was dose-dependent.

[0055] The U.S. applications Ser. No. 60/311,163 and No. 10/215,446, filed Aug. 9, 2001 and 2002, respectively, disclose the identity of RANKL surfaces that are responsible for binding to RANK, and these include external surface loops AA″, CD, DE, and EF. The external (solvent-accessible) surface loops of RANKL are unique within the TNF family, displaying markedly divergent lengths and conformations: the AA″ loop (residues 170-193 of RANKL protein) bridges strands A and A′, the CD loop (residues 224-233) connects strands C and D, the EF loop (residues 261-269) links strands E and F, and the loop DE (residues 245-251) connects strands D and E. RANKL possesses a longer AA″ loop and a shorter EF loop than the typical TNF family member. The AA″ loop, together with the displacement of the CD loop confers a unique surface to the upper third of the RANKL molecule, whereas a subtle shift of the DE loop shapes the receptor binding groove at the base of RANKL molecule. For a detailed description of RANKL loops and binding specificity of RANK/RANKL interaction see U.S. applications Ser. No. 60/311,163 and No. 10/215,446, filed Aug. 9, 2001 and 2002, respectively.

[0056] Thus, applicants have contemplated the use of polypeptides containing RANKL external surface loop sequences and muteins thereof for inhibiting osteoclast differentiation. Consequently, such polypeptides may be used to treat diseases or conditions manifested at least in part by loss of bone mass.

[0057] Accordingly, the present invention provides polypeptides comprising one or more of external surface loops AA″, CD, DE, or EF of RANKL and having the ability to bind to RANK. The RANKL loops correspond to the portions of RANKL molecule (SEQ ID NO 6) described below:

[0058] AA″ contains amino acid residues 170-193 (SEQ ID NO 2),

[0059] CD contains amino acid residues 224-233 (SEQ ID NO 3),

[0060] DE contains amino acid residues 245-251 (SEQ ID NO 4), and

[0061] EF contains amino acid residues 261-269 (SEQ ID NO 5).

[0062] The invention is illustrated by a polypeptide containing a portion of the AA″ loop sequence which includes amino acid residues 175-185 (SEQ ID NO 1).

[0063] In one aspect, the RANKL loops correspond to the portions of human RANKL including polypeptides of SEQ ID NO: 7 and SEQ ID NO: 11 which are natural occurring variants of the AA″ loop of human RANKL. Both are natural variants of human RANKL. SEQ ID NO: 8, SEQ ID NO: 9 and SEQ ID NO: 10 are surface loop polypeptide sequences of human RANKL loops CD, DE and EF respectively.

[0064] In another aspect, the invention encompasses polypeptides that bind to RANK and consist of sequences selected from the group consisting of SEQ ID NO 1-SEQ ID NO 5 and SEQ ID NO: 7-SEQ ID NO: 11. In another aspect, the invention encompasses concatemers of one or more polypeptides selected from SEQ ID NO 1-SEQ ID NO 5 and SEQ ID NO: 7-SEQ ID NO: 11. In another aspect, the invention encompasses therapeutic combinations of said polypeptides. The invention also provides polypeptides comprising RANKL external surface loops and having the ability to competitively inhibit RANKL.

[0065] It will be appreciated that by virtue of the present invention, the above-described polypeptides can be synthesized using conventional synthesis procedures commonly used by one skilled in the art. For example, the polypeptides can be chemically synthesized using an automated peptide synthesizer (such as one manufactured by Pharmacia LKB Biotechnology Co., LKB Biolynk 4170 or Milligen, Model 9050 (Milligen, Millford, Mass.)) following the method of Sheppard, et al., Journal of Chemical Society Perkin I, p. 538 (1981). In this procedure, N,N′-dicyclohexylcarbodiimide is added to amino acids whose amine functional groups are protected by 9-flourenylmethoxycarbonyl (Fmoc) groups and anhydrides of the desired amino acids are produced. These Fmoc-amino acid anhydrides can then be used for peptide synthesis. A Fmoc-amino acid anhydride corresponding to the C-terminal amino acid residue is fixed to Ultrosyn A resin through the carboxyl group using dimethylaminopyridine as a catalyst. Next, the resin is washed with dimethylformamide containing piperidine, and the protecting group of the amino functional group of the C-terminal acid is removed. The next amino acid corresponding to the desired peptide is coupled to the C-terminal amino acid. The deprotecting process is then repeated. Successive desired amino acids are fixed in the same manner until the peptide chain of the desired sequence is formed. The protective groups other than the acetoamidomethyl are then removed and the peptide is released with solvent.

[0066] Alternatively, the polypeptides can be synthesized by using nucleic acid molecules which encode the polypeptides of this invention in an appropriate expression vector which include the encoding nucleotide sequences. Such DNA molecules may be prepared using an automated DNA sequencer and the well-known codon-amino acid relationship of the genetic code. Such a DNA molecule also may be obtained as genomic DNA or as cDNA using oligonucleotide probes and conventional hybridization methodologies. Such DNA molecules may be incorporated into expression vectors, including plasmids, which are adapted for the expression of the DNA and production of the polypeptide in a suitable host such as bacterium, e.g., Escherichia coli, yeast cell, mammalian cell, or insect cell. Mammalian expression systems may facilitate glycosylation that may improve pharmaceutical and/or immunologic properties of the compound.

[0067] Another aspect of the invention is to provide fragments, analogs, and derivatives of the polypeptides of the present invention. The terms “fragment,” “derivative” and “analog” as used herein refer to compounds modified in such manner as to retain the ability to bind RANK. Thus, a fragment may be any suitable portion of the peptide of the present invention so long as the RANK binding functionality is retained by the fragment. Modifications may be achieved by any of the techniques known in the art for derivatization of polypeptides into fragments, analogs, or derivatives thereof. Such terms and in particular, “analog”, also specifically include peptide, non-peptide, small molecules and other compounds that function as RANKL mimics that bind RANKL.

[0068] Those of ordinary skill in the art are aware that modifications in the amino acid sequence of a peptide, polypeptide, or protein can result in equivalent, or possibly improved, second generation peptides, etc., that display equivalent or superior functional characteristics when compared to the original amino acid sequence. The present invention accordingly encompasses such modified amino acid sequences. Alterations can include but are not limited to amino acid insertions, deletions, substitutions, truncations, fusions, cyclization, disulfide bridging, shuffling of subunit sequences, and the like, provided that the peptide sequences produced by such modifications retain the ability to bind RANK. Such modifications may be undertaken to improve compound half-life, biological activity, absorption, distribution, metabolism, excretion, toxicity or the like. One factor that can be considered in making such changes is the hydropathic index of amino acids. The importance of the hydropathic amino acid index in conferring interactive biological function on a protein has been discussed by Kyte and Doolittle (J. Mol. Biol., 157: 105-132, 1982). It is accepted that the relative hydropathic character of amino acids contributes to the secondary structure of the resultant protein. This, in turn, affects the interaction of the protein with molecules such as enzymes, substrates, receptors, DNA, antibodies, antigens, etc.

[0069] Based on its hydrophobicity and charge characteristics, each amino acid has been assigned a hydropathic index as follows: isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (−0.4); threonine (−0.7); serine (−0.8); tryptophan (−0.9); tyrosine (−1.3); proline (−1.6); histidine (−3.2); glutamate/glutamine/aspartate/asparagine (−3.5); lysine (−3.9); and arginine (−4.5).

[0070] As is known in the art, certain amino acids in a peptide or protein can be substituted for other amino acids having a similar hydropathic index or score and produce a resultant peptide or protein having similar biological activity, i.e., which still retains biological functionality. In making such changes, it is preferable that amino acids having hydropathic indices within ±2 are substituted for one another. More preferred substitutions are those wherein the amino acids have hydropathic indices within ±1. Most preferred substitutions are those wherein the amino acids have hydropathic indices within ±0.5.

[0071] Like amino acids can also be substituted on the basis of hydrophilicity. U.S. Pat. No. 4,554,101 discloses that the greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent amino acids, correlates with a biological property of the protein. The following hydrophilicity values have been assigned to amino acids: arginine/lysine (+3.0); aspartate/glutamate (+3.0±1); serine (+0.3); asparagine/glutamine (+0.2); glycine (0); threonine (−0.4); proline (−0.5±1); alanine/histidine (−0.5); cysteine (−1.0); methionine (−1.3); valine (−1.5); leucine/isoleucine (−1.8); tyrosine (−2.3); phenylalanine (−2.5); and tryptophan (−3.4). Thus, one amino acid in a peptide, polypeptide, or protein can be substituted by another amino acid having a similar hydrophilicity score and still produce a resultant protein having similar biological activity, i.e., still retaining correct biological function. In making such changes, amino acids having hydropathic indices within ±2 are preferably substituted for one another, those within ±1 are more preferred, and those within ±0.5 are most preferred.

[0072] As outlined above, amino acid substitutions in the polypeptides of the present invention can be based on the relative similarity of the amino acid side-chain substituents, for example, their hydrophobicity, hydrophilicity, charge, size, etc. Exemplary substitutions that take various of the foregoing characteristics into consideration in order to produce conservative amino acid changes resulting in silent changes within the present polypeptides, etc., can be selected from other members of the class to which the naturally occurring amino acid belongs. Amino acids can be divided into the following four groups: (1) acidic amino acids; (2) basic amino acids; (3) neutral polar amino acids; and (4) neutral non-polar amino acids. Representative amino acids within these various groups include, but are not limited to: (1) acidic (negatively charged) amino acids such as aspartic acid and glutamic acid; (2) basic (positively charged) amino acids such as arginine, histidine, and lysine; (3) neutral polar amino acids such as glycine, serine, threonine, cysteine, cystine, tyrosine, asparagine, and glutamine; and (4) neutral non-polar amino acids such as alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine. It should be noted that changes which are not expected to be advantageous can also be useful if these result in the production of functional sequences.

[0073] Thus, the fragment, derivative or analog of the polypeptides of the present invention may be, for example and without limitation, (i) one in which one or more amino acid residues are substituted with a conserved or non-conserved amino acid residue, and such substituted amino acid residue may or may not be one encoded by the genetic code; (ii) one in which one or more of the amino acid residues includes a substituent group; (iii) one in which the mature protein is fused to another compound such as a compound to increase the half-life of the protein; (iv) one in which additional amino acids are fused to the protein to aid in purification or in detection and identification; or (v) one in which additional amino acid residues are fused to the protein to aid in modifying tissue distribution or localization of the protein to certain locations such as the cell membrane or extracellular compartments; or (vi) one in which another molecule, possibly a small, non-peptide molecule, mimics the RANK-binding functionality of the polypeptide.

[0074] It is standard practice in the art to modify polypeptides to improve their potential as pharmacological agents. Thus, modifications can be made without completely affecting a polypeptide's binding to RANK and/or its ability to inhibit osteoclast differentiation. For example, oligomerization of RANKL has been shown to be a useful technique for delaying internalization of the protein. In addition, cyclization may be used to stabilize the RANKL mimics of the present invention. Similarly, known treatments to increase the stability or other beneficial characteristic of the polypeptide, such as substitution of L- with D-amino acids, or PEG-alation, may be utilized to like effect with the RANKL mimics of the present invention.

[0075] In one aspect, the polypeptide sequence may be flanked at either of both of its N- and/or C-termini by residues. When included, such flanking residues should not significantly alter the ability of the core sequence to bind to RANK and inhibit RANK/RANKL interaction. Flanking residues may include Cysteines to facilitate disulfide bridging. Thus, in an embodiment, the polypeptides of the invention may include flanking residues at each terminus that may be fewer than 5 residues each. In a preferred embodiment, fewer than 3 flanking residues at each terminus, and most preferably no flanking residues.

[0076] Such molecules may contain a number of modifications known to those skilled in the art. For instance, substituted amide linkages may generally include, but are not limited to, groups of the formula —C(O)N(R)—, where R is (C₁-C₆) alkyl, substituted (C₁-C₆) alkyl, (C₂-C₆) alkenyl, substituted (C₂-C₆) alkenyl, (C₂-C₆) alkynyl, substituted (C₂-C₆) alkynyl, (C₅-C₂₀) aryl, substituted (C₅-C₂₀) aryl, (C₆-C₂₆) arylalkyl, substituted (C₆-C₂₆) arylalkyl, 5-20 membered heteroaryl, substituted 5-20 membered heteroaryl, 6-26 membered heteroarylalkyl and substituted 6-26 membered heteroarylalkyl.

[0077] Isosteres of amide linkages may generally include, but are not limited to, —CH₂NH—, —CH₂S—, —CH₂CH₂—, —CH═CH— (cis and trans), —C(O)CH₂—, —CH(OH)CH₂— and —CH₂SO—. Compounds having such non-amide linkages and methods for preparing such compounds are well-known in the art (see, e.g., Spatola, March 1983, Vega Data Vol. 1, Issue 3; Spatola, 1983, “Peptide Backbone Modifications” In: Chemistry and Biochemistry of Amino Acids Peptides and Proteins, Weinstein, ed., Marcel Dekker, New York, p. 267 (general review); Morley, 1980, Trends Pharm. Sci. 1:463-468; Hudson et al., 1979, Int. J. Prot. Res. 14:177-185 (—CH₂NH—, —CH₂CH₂—); Spatola et al., 1986, Life Sci. 38:1243-1249 (—CH₂—S); Hann, 1982, J. Chem. Soc. Perkin Trans. I. 1:307-314 (—CH═CH—, cis and trans); Almquist et al., 1980, J. Med. Chem. 23:1392-1398 (—COCH₂—); Jennings-White et al., Tetrahedron. Lett. 23:2533 (—COCH₂—); European Patent Application EP 45665 (1982) CA 97:39405 (—CH(OH)CH₂—); Holladay et al., 1983, Tetrahedron Lett. 24:4401-4404 (—C(OH)CH₂—); and Hruby, 1982, Life Sci. 31:189-199 (—CH₂—S—).

[0078] Additionally, one or more amide linkages can be replaced with peptidomimetic or amide mimetic moieties which do not significantly interfere with the structure or activity of the peptides. Alternatively, all of amide linkages may be replaced with peptidomimetic moieties. Suitable amide mimetic moieties are described, for example, in Olson et al., 1993, J. Med. Chem. 36:3039-3049.

[0079] The peptides and peptide analogs may optionally include a peptide or peptide analog at either or both termini that may be 1 to 5 residues or more in length. Peptide analogs typically contain at least one modified interlinkage, such as a substituted amide or an isostere of an amide, as described above. Such additional peptides or peptide analogs may have an amino acid sequence derived from another portion of the RANKL amino acid sequence or, alternatively, their sequences may be completely random. Peptides including such random sequences may be tested for biological activity, i.e. their ability to bind to RANK.

[0080] One method of testing compound binding to RANK is determined by performing an assay such as, e.g., a binding assay between a desired compound and RANK. In one aspect, this is done by contacting said compound to RANK and determining its dissociation rate. Numerous possibilities for performing binding assays are well known in the art. The indication of a compound's ability to bind to RANK is determined, e.g., by a dissociation rate, and the correlation of binding activity and dissociation rates is well established in the art.

[0081] For example, the assay may be performed by radio-labeling a reference compound, peptide, or protein such as RANKL or isolated external surface loops therefrom, e.g. with ¹²⁵l and incubating it with RANK in 1.5 ml tubes. Test compound s are then added to these reactions in increasing concentrations. After optimal incubation, the RANK/compound complexes are separated, e.g., with chromatography columns, and evaluated for bound ¹²⁵l-labeled peptide with γ counter. The amount of the test compound necessary to inhibit 50% of the reference peptide's binding is determined. These values are then normalized to the concentration of unlabeled reference peptide's binding (relative inhibitory concentration (RIC)⁻¹=concentration_(test)/concentration_(reference)). A small RIC⁻¹ value indicates strong relative binding, whereas a large RIC⁻¹ value indicates weak relative binding. See, for example, Latek et al., Proc. Natl. Acad. Sci. USA, Vol. 97, No. 21, pp. 11460-11465, 2000. Of course, high throughput binding assays such as those offered commercially by PerkinElmer, Actelion and others, are also suitable for testing compound binding.

[0082] Also included within the scope of the present invention are “blocked” forms of the peptides and peptide analogs in which the N- and/or C-terminus is blocked with a moiety capable of reacting with the N-terminal —NH₂ or C-terminal —C(O)OH. Such blocked compounds are typically N-terminal acylated and/or C-terminal amidated or esterified. Typical N-terminal blocking groups include R¹C(O)—, where R¹ is hydrogen, (C₁-C₆) alkyl, (C₂-C₆) alkenyl, (C₂-C₆) alkynyl, (C₅-C₂₀) aryl, (C₆-C₂₆) arylalkyl, 5-20 membered heteroaryl or 6-26 membered heteroarylalkyl. Preferred N-terminal blocking groups include acetyl, formyl and dansyl. Typical C-terminal blocking groups include —C(O)NR¹R¹ and —C(O)OR¹, where each R¹ is independently as defined above. Preferred C-terminal blocking groups include those in which each R¹ is independently (C₁-C₆) alkyl, preferably methyl, ethyl, propyl or isopropyl.

[0083] In a preferred embodiment of the invention, a method of preventing or inhibiting bone loss, a method of inhibiting osteoclast differentiation, and a method of competitively inhibiting RANKL activity are provided by administering compositions comprising compounds identified by the screening methods described herein. The bone forming compositions of the present invention may be utilized by providing an effective amount of such compositions to a subject in need thereof. The methods and compositions may be used to treat many diseases or conditions characterized by bone loss or thinning. Such diseases and conditions include osteoporosis, juvenile osteoporosis, osteogenesis imperfecta, hypercalcemia, hyperparathyroidism, osteomalacia, osteohalisteresis, osteolytic bone disease, osteonecrosis, Paget's disease of bone, bone loss due to rheumatoid arthritis, inflammatory arthritis, osteomyelitis, corticosteroid treatment, metastatic bone diseases, periodontal bone loss, bone loss due to cancer, age-related loss of bone mass, and other forms of osteopenia. In a preferred embodiment, the methods and compositions of the invention are used to treat osteoporosis, osteolytic bone disease, bone loss due to rheumatoid arthritis, and skeletal metastasis.

[0084] For use for treatment of animal subjects, the compositions of the invention can be formulated as pharmaceutical or veterinary compositions. Depending on the subject to be treated, the mode of administration, and the type of treatment desired, e.g., prevention, prophylaxis, therapy; the compositions are formulated in ways consonant with these parameters. A summary of such techniques is found in Remington's Pharmaceutical Sciences, latest edition, Mack Publishing Co., Easton, Pa.

[0085] The administration of the compositions of the present invention may be pharmacokinetically and pharmacodynamically controlled by calibrating various parameters of administration, including the frequency, dosage, duration mode and route of administration. Variations in the dosage, duration and mode of administration may also be manipulated to produce the activity required.

[0086] For administration to animal or human subjects, the dosage of the compounds of the invention is typically 0.01-100 mg/kg. However, dosage levels are highly dependent on the nature of the disease or situation, the condition of the patient, the judgment of the practitioner, and the frequency and mode of administration. If the oral route is employed, the absorption of the substance will be a factor effecting bioavailability. A low absorption will have the effect that in the gastro-intestinal tract higher concentrations, and thus higher dosages, will be necessary.

[0087] It will be understood that the appropriate dosage of the substance should suitably be assessed by performing animal model tests, wherein the effective dose level (e.g. ED₅₀) and the toxic dose level (e.g. TD₅₀) as well as the lethal dose level (e.g. LD₅₀ or LD₁₀) are established in suitable and acceptable animal models. Further, if a substance has proven efficient in such animal tests, controlled clinical trials should be performed.

[0088] In general, for use in treatment, the compounds of the invention may be used alone or in combination with other compositions for the treatment of bone loss. Such compositions include anti-resorptives such as a bisphosphonate, a calcitonin, a calcitriol, an estrogen, SERM's and a calcium source, or a supplemental bone formation agent like parathyroid hormone or its derivative, a bone morphogenetic protein, osteogenin, NaF, or a statin. See U.S. Pat. No. 6,080,779 incorporated herein by reference. Depending on the mode of administration, the compounds will be formulated into suitable compositions.

[0089] Formulations may be prepared in a manner suitable for systemic administration or for topical or local administration. Systemic formulations include, but are not limited to those designed for injection (e.g., intramuscular, intravenous or subcutaneous injection) or may be prepared for transdermal, transmucosal, nasal, or oral administration. The formulation will generally include a diluent as well as, in some cases, adjuvants, buffers, preservatives and the like.

[0090] For oral administration, the compositions can be administered also in liposomal compositions or as microemulsions. Suitable forms include syrups, capsules, tablets, as is understood in the art. For injection, formulations can be prepared in conventional forms as liquid solutions or suspensions or as solid forms suitable for solution or suspension in liquid prior to injection or as emulsions. Suitable excipients include, for example, water, saline, dextrose, glycerol and the like. Such compositions may also contain amounts of nontoxic auxiliary substances such as wetting or emulsifying agents, pH buffering agents and the like, such as, for example, sodium acetate, sorbitan monolaurate, and so forth.

[0091] The compositions of the present invention may also be administered locally to sites in patients, both human and other vertebrates, such as domestic animals, rodents and livestock, where decreased bone loss and/or increased bone mass are desired using a variety of techniques known to those skilled in the art. For example, these may include sprays, lotions, gels or other vehicles such as alcohols, polyglycols, esters, oils and silicones. Such local applications include, for example, at a site of a bone fracture or defect to repair or replace damaged bone. Additionally, an anti-resorptive agent may be administered e.g., in a suitable carrier, at a junction of an autograft, allograft or prosthesis and native bone to assist in binding of the graft or prosthesis to the native bone.

[0092] Another embodiment of the present invention involves use of the RANKL loops in competitive binding assays to screen for inhibitors of RANKL. Binding to RANK is determined by performing an assay as described above as a binding assay between a desired compound and RANK. In one aspect, this is done by contacting a test compound to RANK and determining its dissociation rate. Numerous possibilities for performing binding assays are well known in the art. The indication of a compound's ability to bind to RANK is determined, e.g., by a dissociation rate, and the correlation of binding activity and dissociation rates is well established in the art. For example, the assay may be performed by radio-labeling a reference compound, e.g. a polypeptide containing a portion of AA″ loop sequence SEQ ID NO 1 with ¹²⁵I and incubating it with RANK in 1.5 ml tubes. Test compounds are then added to these reactions in increasing concentrations. After optimal incubation, the RANK/compound complexes are separated, e.g., with chromatography columns, and evaluated for bound ¹²⁵I-labeled peptide with gamma (γ) counter. The amount of the test compound necessary to inhibit 50% of the reference compound's binding is determined. These values are then normalized to the concentration of unlabeled reference compound's binding (relative inhibitory concentration (RIC)⁻¹=concentration_(test)/concentration_(reference)). A small RIC⁻¹ value indicates strong relative binding, whereas a large RIC⁻¹ value indicates weak relative binding. See, for example, Latek et al., Proc. Natl. Acad. Sci. USA, Vol. 97, No. 21, pp. 11460-11465,2000. The RANKL loops identified in SEQ ID NO 2-5 and fragments thereof are also suitable for use individually or in combination as reference compounds in such assays. Again, high throughput assays are also suitable to perform the binding assays involving one or more of the RANKL loops.

[0093] Other features, objects and advantages of the present invention will be apparent to those skilled in the art. The explanations and illustrations presented herein are intended to acquaint others skilled in the art with the invention, its principles, and its practical application. Those skilled in the art may adapt and apply the invention in its numerous forms, as may be best suited to the requirements of a particular use. Accordingly, the specific embodiments of the present invention as set forth are not intended as being exhaustive or limiting of the present invention.

[0094] All publications and patent applications cited in this specification are herein incorporated by reference as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.

[0095] The following examples illustrate the invention, but are not to be taken as limiting the various aspects of the invention so illustrated.

EXAMPLES Example 1

[0096] The effect of a partial AA″ loop polypeptide of RANKL on TRAP activity. Wild type C3H/HENJ mice were purchased from Harlan Industries (Indianapolis, Ind.). In order to establish a cell culture of osteoclast precursors, bone marrow macrophages (BMMs) were isolated from whole bone marrow of four to six week old mice and incubated in tissue culture dishes at 37° C. in 5% CO₂. After 24 hours in culture, the non-adherent cells were collected and layered on a Ficoll Hypaque gradient and the cells at the gradient interface were collected. Cells were replated at 65,000/cm² in α-minimal essential medium, supplemented with 10% heat inactivated fetal bovine serum, at 37° C. in 5% CO₂ in the presence of recombinant mouse M-CSF (10 ng.ml). The cell cultures were then treated either with the increasing concentrations of polypeptide of SEQ ID NO 1 (subset of AA″ loop) or with a negative control scrambled peptide having the same molecular weight as the test compound.

[0097] The results of the experiment are shown in FIG. 1. As can be seen from the same figure, addition of the test compound to the cell cultures of osteoclast precursors competitively inhibits the ability of RANKL to induce osteoclast differentiation as monitored by tartrate specific acid phosphatase (TRAP). This phosphatase is an osteoclast specific enzyme and its activity corresponds with osteoclast differentiation. TRAP activity is measured by quantitating its ability to cleave a substrate in a color-producing reaction.

[0098] As can also be seen from FIG. 1, the osteoclast differentiation inhibition is dose-dependent, i.e. increasing the concentration of the test compound in culture decreases TRAP activity, whereas the negative control peptide has no affect on TRAP activity.

0 SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 51 <210> SEQ ID NO 1 <211> LENGTH: 1823 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: NCBI/AF013171 <309> DATABASE ENTRY DATE: 1997-09-19 <313> RELEVANT RESIDUES: (1)..(1823) <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: NCBI/NM_033012.2 <309> DATABASE ENTRY DATE: 2002-07-31 <313> RELEVANT RESIDUES: (1)..(1823) <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: NCBI/AF053712.1 <309> DATABASE ENTRY DATE: 1998-05-09 <313> RELEVANT RESIDUES: (1)..(1823) <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: NCBI/AF019047.1 <309> DATABASE ENTRY DATE: 1997-11-22 <313> RELEVANT RESIDUES: (1)..(1823) <400> SEQUENCE: 1 cagatggatc ctaatagaat atcagaagat ggcactcact gcatttatag aattttgaga 60 ctccatgaaa atgcagattt tcaagacaca actctggaga gtcaagatac aaaattaata 120 cctgattcat gtaggagaat taaacaggcc tttcaaggag ctgtgcaaaa ggaattacaa 180 catatcgttg gatcacagca catcagagca gagaaagcga tggtggatgg ctcatggtta 240 gatctggcca agaggagcaa gcttgaagct cagccttttg ctcatctcac tattaatgcc 300 accgacatcc catctggttc ccataaagtg agtctgtcct cttggtacca tgatcggggg 360 tggggtaaga tctccaacat gacttttagc aatggaaaac taatagttaa tcaggatggc 420 ttttattacc tgtatgccaa catttgcttt cgacatcatg aaacttcagg agacctagct 480 acagagtatc ttcaactaat ggtgtacgtc actaaaacca gcatcaaaat cccaagttct 540 cataccctga tgaaaggagg aagcaccaag tattggtcag ggaattctga attccatttt 600 tattccataa acgttggtgg attttttaag ttacggtctg gagaggaaat cagcatcgag 660 gtctccaacc cctccttact ggatccggat caggatgcaa catactttgg ggcttttaaa 720 gttcgagata tagattgagc cccagttttt ggagtgttat gtatttcctg gatgtttgga 780 aacatttttt aaaacaagcc aagaaagatg tatataggtg tgtgagacta ctaagaggca 840 tggcccaacg gtacacgact cagtatccat gctcttgacc ttgtagagaa cacgcgtatt 900 tacagccagt gggagatgtt agactcatgg tgtgttacac aatggttttt aaattttgta 960 atgaattcct agaattaaac cagattggag caattacggg ttgaccttat gagaaactgc 1020 atgtgggcta tgggaggggt tggtccctgg tcatgtgccc cttcgcagct gaagtggaga 1080 gggtgtcatc tagcgcaatt gaaggatcat ctgaaggggc aaattctttt gaattgttac 1140 atcatgctgg aacctgcaaa aaatactttt tctaatgagg agagaaaata tatgtatttt 1200 tatataatat ctaaagttat atttcagatg taatgttttc tttgcaaagt attgtaaatt 1260 atatttgtgc tatagtattt gattcaaaat atttaaaaat gtcttgctgt tgacatattt 1320 aatgttttaa atgtacagac atatttaact ggtgcacttt gtaaattccc tggggaaaac 1380 ttgcagctaa ggaggggaaa aaatgttgtt tcctaatatc aaatgcagta tatttcttcg 1440 ttctttttaa gttaatagat tttttcagac ttgtcaagcc tgtgcaaaaa aattaaaatg 1500 gatgccttga ataataagca ggatgttggc caccaggtgc ctttcaaatt tagaaactaa 1560 ttgactttag aaagctgaca ttgccaaaaa ggatacataa tgggccactg aaatctgtca 1620 agagtagtta tataattgtt gaacaggtgt ttttccacaa gtgccgcaaa ttgtaccttt 1680 ttttgttttt ttcaaaatag aaaagttatt agtggtttat cagcaaaaaa gtccaatttt 1740 aatttagtaa atgttatctt atactgtaca ataaaaacat tgcctttgaa tgttaatttt 1800 ttggtacaaa agtcgacggc cgc 1823 <210> SEQ ID NO 2 <211> LENGTH: 1769 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: NCBI/NM_003810.2 <309> DATABASE ENTRY DATE: 2002-10-07 <313> RELEVANT RESIDUES: (1)..(1769) <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: NCBI/U57059.1 <309> DATABASE ENTRY DATE: 1999-03-04 <313> RELEVANT RESIDUES: (1)..(1769) <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: NCBI/U37518.1 <309> DATABASE ENTRY DATE: 1996-01-06 <313> RELEVANT RESIDUES: (1)..(1769) <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: NCBI/BC032722.1 <309> DATABASE ENTRY DATE: 2002-06-27 <313> RELEVANT RESIDUES: (1)..(1769) <400> SEQUENCE: 2 cctcactgac tataaaagaa tagagaagga agggcttcag tgaccggctg cctggctgac 60 ttacagcagt cagactctga caggatcatg gctatgatgg aggtccaggg gggacccagc 120 ctgggacaga cctgcgtgct gatcgtgatc ttcacagtgc tcctgcagtc tctctgtgtg 180 gctgtaactt acgtgtactt taccaacgag ctgaagcaga tgcaggacaa gtactccaaa 240 agtggcattg cttgtttctt aaaagaagat gacagttatt gggaccccaa tgacgaagag 300 agtatgaaca gcccctgctg gcaagtcaag tggcaactcc gtcagctcgt tagaaagatg 360 attttgagaa cctctgagga aaccatttct acagttcaag aaaagcaaca aaatatttct 420 cccctagtga gagaaagagg tcctcagaga gtagcagctc acataactgg gaccagagga 480 agaagcaaca cattgtcttc tccaaactcc aagaatgaaa aggctctggg ccgcaaaata 540 aactcctggg aatcatcaag gagtgggcat tcattcctga gcaacttgca cttgaggaat 600 ggtgaactgg tcatccatga aaaagggttt tactacatct attcccaaac atactttcga 660 tttcaggagg aaataaaaga aaacacaaag aacgacaaac aaatggtcca atatatttac 720 aaatacacaa gttatcctga ccctatattg ttgatgaaaa gtgctagaaa tagttgttgg 780 tctaaagatg cagaatatgg actctattcc atctatcaag ggggaatatt tgagcttaag 840 gaaaatgaca gaatttttgt ttctgtaaca aatgagcact tgatagacat ggaccatgaa 900 gccagttttt tcggggcctt tttagttggc taactgacct ggaaagaaaa agcaataacc 960 tcaaagtgac tattcagttt tcaggatgat acactatgaa gatgtttcaa aaaatctgac 1020 caaaacaaac aaacagaaaa cagaaaacaa aaaaacctct atgcaatctg agtagagcag 1080 ccacaaccaa aaaattctac aacacacact gttctgaaag tgactcactt atcccaagaa 1140 aatgaaattg ctgaaagatc tttcaggact ctacctcata tcagtttgct agcagaaatc 1200 tagaagactg tcagcttcca aacattaatg caatggttaa catcttctgt ctttataatc 1260 tactccttgt aaagactgta gaagaaagcg caacaatcca tctctcaagt agtgtatcac 1320 agtagtagcc tccaggtttc cttaagggac aacatcctta agtcaaaaga gagaagaggc 1380 accactaaaa gatcgcagtt tgcctggtgc agtggctcac acctgtaatc ccaacatttt 1440 gggaacccaa ggtgggtaga tcacgagatc aagagatcaa gaccatagtg accaacatag 1500 tgaaacccca tctctactga aagtgcaaaa attagctggg tgtgttggca catgcctgta 1560 gtcccagcta cttgagaggc tgaggcagga gaatcgtttg aacccgggag gcagaggttg 1620 cagtgtggtg agatcatgcc actacactcc agcctggcga cagagcgaga cttggtttca 1680 aaaaaaaaaa aaaaaaaaaa cttcagtaag tacgtgttat ttttttcaat aaaattctat 1740 tacagtatgt caaaaaaaaa aaaaaaaaa 1769 <210> SEQ ID NO 3 <211> LENGTH: 1803 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: NCBI/X67878.1 <309> DATABASE ENTRY DATE: 1997-06-06 <313> RELEVANT RESIDUES: (1)..(1803) <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: NCBI/NM_000074.1 <309> DATABASE ENTRY DATE: 2002-04-10 <313> RELEVANT RESIDUES: (1)..(1803) <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: NCBI/L07414.1 <309> DATABASE ENTRY DATE: 1993-04-27 <313> RELEVANT RESIDUES: (1)..(1803) <400> SEQUENCE: 3 tgccaccttc tctgccagaa gataccattt caactttaac acagcatgat cgaaacatac 60 aaccaaactt ctccccgatc tgcggccact ggactgccca tcagcatgaa aatttttatg 120 tatttactta ctgtttttct tatcacccag atgattgggt cagcactttt tgctgtgtat 180 cttcatagaa ggttggacaa gatagaagat gaaaggaatc ttcatgaaga ttttgtattc 240 atgaaaacga tacagagatg caacacagga gaaagatcct tatccttact gaactgtgag 300 gagattaaaa gccagtttga aggctttgtg aaggatataa tgttaaacaa agaggagacg 360 aagaaagaaa acagctttga aatgcaaaaa ggtgatcaga atcctcaaat tgcggcacat 420 gtcataagtg aggccagcag taaaacaaca tctgtgttac agtgggctga aaaaggatac 480 tacaccatga gcaacaactt ggtaaccctg gaaaatggga aacagctgac cgttaaaaga 540 caaggactct attatatcta tgcccaagtc accttctgtt ccaatcggga agcttcgagt 600 caagctccat ttatagccag cctctgccta aagtcccccg gtagattcga gagaatctta 660 ctcagagctg caaataccca cagttccgcc aaaccttgcg ggcaacaatc cattcacttg 720 ggaggagtat ttgaattgca accaggtgct tcggtgtttg tcaatgtgac tgatccaagc 780 caagtgagcc atggcactgg cttcacgtcc tttggcttac tcaaactctg aacagtgtca 840 ccttgcaggc tgtggtggag ctgacgctgg gagtcttcat aatacagcac agcggttaag 900 cccaccccct gttaactgcc tatttataac cctaggatcc tccttatgga gaactattta 960 ttatacactc caaggcatgt agaactgtaa taagtgaatt acaggtcaca tgaaaccaaa 1020 acgggccctg ctccataaga gcttatatat ctgaagcagc aaccccactg atgcagacat 1080 ccagagagtc ctatgaaaag acaaggccat tatgcacagg ttgaattctg agtaaacagc 1140 agataacttg ccaagttcag ttttgtttct ttgcgtgcag tgtctttcca tggataatgc 1200 atttgattta tcagtgaaga tgcagaaggg aaatggggag cctcagctca cattcagtta 1260 tggttgactc tgggttccta tggccttgtt ggagggggcc aggctctaga acgtctaaca 1320 cagtggagaa ccgaaacccc cccccccccc ccgccaccct ctcggacagt tattcattct 1380 ctttcaatct ctctctctcc atctctctct ttcagtctct ctctctcaac ctctttcttc 1440 caatctctct ttctcaatct ctctgtttcc ctttgtcagt ctcttccctc ccccagtctc 1500 tcttctcaat ccccctttct aacacacaca cacacacaca cacacacaca cacacacaca 1560 cacacacaca cagagtcagg ccgttgctag tcagttctct tctttccacc ctgtccctat 1620 ctctaccact atagatgagg gtgaggagta gggagtgcag ccctgagcct gcccactcct 1680 cattacgaaa tgactgtatt taaaggaaat ctattgtatc tacctgcagt ctccattgtt 1740 tccagagtga acttgtaatt atcttgttat ttattttttg aataataaag acctcttaac 1800 att 1803 <210> SEQ ID NO 4 <211> LENGTH: 1643 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 4 gcagaggacc agctaagagg gagagaagca actacagacc ccccctgaaa acaaccctca 60 gacgccacat cccctgacaa gctgccaggc aggttctctt cctctcacat actgacccac 120 ggctccaccc tctctcccct ggaaaggaca ccatgagcac tgaaagcatg atccgggacg 180 tggagctggc cgaggaggcg ctccccaaga agacaggggg gccccagggc tccaggcggt 240 gcttgttcct cagcctcttc tccttcctga tcgtggcagg cgccaccacg ctcttctgcc 300 tgctgcactt tggagtgatc ggcccccaga gggaagagtt ccccagggac ctctctctaa 360 tcagccctct ggcccaggca gtcagatcat cttctcgaac cccgagtgac aagcctgtag 420 cccatgttgt agcaaaccct caagctgagg ggcagctcca gtggctgaac cgccgggcca 480 atgccctcct ggccaatggc gtggagctga gagataacca gctggtggtg ccatcagagg 540 gcctgtacct catctactcc caggtcctct tcaagggcca aggctgcccc tccacccatg 600 tgctcctcac ccacaccatc agccgcatcg ccgtctccta ccagaccaag gtcaacctcc 660 tctctgccat caagagcccc tgccagaggg agaccccaga gggggctgag gccaagccct 720 ggtatgagcc catctatctg ggaggggtct tccagctgga gaagggtgac cgactcagcg 780 ctgagatcaa tcggcccgac tatctcgact ttgccgagtc tgggcaggtc tactttggga 840 tcattgccct gtgaggagga cgaacatcca accttcccaa acgcctcccc tgccccaatc 900 cctttattac cccctccttc agacaccctc aacctcttct ggctcaaaaa gagaattggg 960 ggcttagggt cggaacccaa gcttagaact ttaagcaaca agaccaccac ttcgaaacct 1020 gggattcagg aatgtgtggc ctgcacagtg aattgctggc aaccactaag aattcaaact 1080 ggggcctcca gaactcactg gggcctacag ctttgatccc tgacatctgg aatctggaga 1140 ccagggagcc tttggttctg gccagaatgc tgcaggactt gagaagacct cacctagaaa 1200 ttgacacaag tggaccttag gccttcctct ctccagatgt ttccagactt ccttgagaca 1260 cggagcccag ccctccccat ggagccagct ccctctattt atgtttgcac ttgtgattat 1320 ttattattta tttattattt atttatttac agatgaatgt atttatttgg gagaccgggg 1380 tatcctgggg gacccaatgt aggagctgcc ttggctcaga catgttttcc gtgaaaacgg 1440 agctgaacaa taggctgttc ccatgtagcc ccctggcctc tgtgccttct tttgattatg 1500 ttttttaaaa tatttatctg attaagttgt ctaaacaatg ctgatttggt gaccaactgt 1560 cactcattgc tgagcctctg ctccccaggg gagttgtgtc tgtaatcgcc ctactattca 1620 gtggcgagaa ataaagtttg ctt 1643 <210> SEQ ID NO 5 <211> LENGTH: 1325 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 5 gaggtttatt gggcctcggt cctcctgcac ctgctgcctg gatccccggc ctgcctgggc 60 ctgggccttg gttctcccca tgacaccacc tgaacgtctc ttcctcccaa gggtgtgtgg 120 caccacccta cacctcctcc ttctggggct gctgctggtt ctgctgcctg gggcccaggg 180 gctccctggt gttggcctca caccttcagc tgcccagact gcccgtcagc accccaagat 240 gcatcttgcc cacagcaccc tcaaacctgc tgctcacctc attggagacc ccagcaagca 300 gaactcactg ctctggagag caaacacgga ccgtgccttc ctccaggatg gtttctcctt 360 gagcaacaat tctctcctgg tccccaccag tggcatctac ttcgtctact cccaggtggt 420 cttctctggg aaagcctact ctcccaaggc cacctcctcc ccactctacc tggcccatga 480 ggtccagctc ttctcctccc agtacccctt ccatgtgcct ctcctcagct cccagaagat 540 ggtgtatcca gggctgcagg aaccctggct gcactcgatg taccacgggg ctgcgttcca 600 gctcacccag ggagaccagc tatccaccca cacagatggc atcccccacc tagtcctcag 660 ccctagtact gtcttctttg gagccttcgc tctgtagaac ttggaaaaat ccagaaagaa 720 aaaataattg atttcaagac cttctcccca ttctgcctcc attctgacca tttcaggggt 780 cgtcaccacc tctcctttgg ccattccaac agctcaagtc ttccctgatc aagtcaccgg 840 agctttcaaa gaaggaattc taggcatccc aggggaccca cactccctga accatccctg 900 atgtctgtct ggctgaggat ttcaagcctg cctaggaatt cccagcccaa agctgttggt 960 cttgtccacc agctaggtgg ggcctagatc cacacacaga ggaagagcag gcacatggag 1020 gagcttgggg gatgactaga ggcagggagg ggactattta tgaaggcaaa aaaattaaat 1080 tatttattta tggaggatgg agagagggaa taatagaaga acatccaagg agaaacagag 1140 acaggcccaa gagatgaaga gtgagagggc atgcgcacaa ggctgaccaa gagagaaaga 1200 agtaggcatg agggatcaca gggccccaga aggcagggaa aggctctgaa agccagctgc 1260 cgaccagagc cccacacgga ggcatctgca ccctcgatga agcccaataa acctcttttc 1320 tctga 1325 <210> SEQ ID NO 6 <211> LENGTH: 5307 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: (4242)..(4242) <223> OTHER INFORMATION: n = a, t, c or g <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: (4471)..(4471) <223> OTHER INFORMATION: n = a, t, c or g <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: (4523)..(4523) <223> OTHER INFORMATION: n = a, t, c or g <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: (4529)..(4529) <223> OTHER INFORMATION: n = a, t, c or g <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: (4531)..(4531) <223> OTHER INFORMATION: n = a, t, c or g <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: (4545)..(4545) <223> OTHER INFORMATION: n = a, t, c or g <400> SEQUENCE: 6 attccctcgg cgggccgagc ctcccctctc tcccgcccct cctcctccct ttcccacccc 60 tcggagtaga gctgcacatg cggctgctcc ctgctccgtc ccgcccagcc actgtcgcgc 120 aggaacgggt ccctgcagcc cccagccgat ggcaggacag tagccgcctg tcagaggtcg 180 tgaacggctg aggcagacgc agcggctccc gggcctcaag agagtggatg tctccggagg 240 ccatgggcta cccggaggtg gagcgcaggg aactcctgcc tgcagcagcg ccgcgggagc 300 gagggagcca gggctgcggg tgtggcgggg cccctgcccg ggcgggcgaa gggaacagct 360 gcctgctctt cctgggtttc tttggcctct cgctggccct ccacctgctg acgttgtgct 420 gctacctaga gttgcgctcg gagttgcggc gggaacgtgg agccgagtcc cgccttggcg 480 gctcgggcac ccctggcacc tctggcaccc taagcagcct cggtggcctc gaccctgaca 540 gccccatcac cagtcacctt gggcagccgt cacctaagca gcagccattg gaaccgggag 600 aagccgcact ccactctgac tcccaggacg ggcaccagat ggccctattg aatttcttct 660 tccctgatga aaagccatac tctgaagaag aaagtaggcg tgttcgccgc aataaaagaa 720 gcaaaagcaa tgaaggagca gatggcccag ttaaaaacaa gaaaaaggga aagaaagcag 780 gacctcctgg acccaatggc cctccaggac ccccaggacc tccaggaccc cagggacccc 840 caggaattcc agggattcct ggaattccag gaacaactgt tatgggacca cctggtcctc 900 caggtcctcc tggtcctcaa ggaccccctg gcctccaggg accttctggt gctgctgata 960 aagctggaac tcgagaaaac cagccagctg tggtgcatct acagggccaa gggtcagcaa 1020 ttcaagtcaa gaatgatctt tcaggtggag tgctcaatga ctggtctcgc atcactatga 1080 accccaaggt gtttaagcta catccccgca gcggggagct ggaggtactg gtggacggca 1140 cctacttcat ctatagtcag gtagaagtat actacatcaa cttcactgac tttgccagct 1200 atgaggtggt ggtggatgag aagcccttcc tgcagtgcac acgcagcatc gagacgggca 1260 agaccaacta caacacttgc tataccgcag gcgtctgcct cctcaaggcc cggcagaaga 1320 tcgccgtcaa gatggtgcac gctgacatct ccatcaacat gagcaagcac accacgttct 1380 ttggggccat caggctgggt gaagcccctg catcctagat tccccccatt ttgcctctgt 1440 ccgtgcccct tccctgggtt tgggagccag gactcccaga acctctaagt gctgctgtgg 1500 agtgaggtgt attggtgttg cagccgcaga gaaatgcccc agtgttattt attccccagt 1560 gactccaggg tgacaaggcc tgcttgactt tccagaatga ccttgagtta acaggacagt 1620 tgatggagcc ccagggttta catgaagcag aaccttcttt ggttccatgt tgactgactt 1680 atggcatgac tcttcaaccc cgaggtccct gttgtcagat ctattgtttg ttgcactaaa 1740 atgaggatcc agggcagcag gccagagaaa gcaaaggtgc actccagact ctgggggtgg 1800 acatctgacc ccaagggggc tgctgctcct ctcttgggta gggtagtggc tggggtggag 1860 tgggaagkga gcattgcagc ctaagaagaa ggccagagag ggaaaaggca ggtgcttttg 1920 gcagagacca taagagaaac ctgccaagga gcatccttgg cagtgggaat gttctttctg 1980 ctctatactg tggcctgcag gagggttgga gtgctcttcc cactccagct gacagccaca 2040 ccgtggcagc ttgctgggct ttgggaagtt tgctgtgctt tggaacaatc acagggaatg 2100 gccacaaacc tgcccgccta agaccctgaa tccgtacttg ggtcacatga ctctcatttt 2160 atttacagct gtgctccaca ctcagaaaat tccctggggt caccttctag ttgcccccat 2220 tcccagcctg actagaactc ctgtcttctt tctccatgga gcctacctct gtctgagaca 2280 ggtgcctaac ctgggacctg tggtcatgtg agtctgggat attctttagc ttacctgggc 2340 acagacagaa ttttccattt attaagcagt acagatgttt ttcatccatt cctaatcaaa 2400 ttctgtctgg ggacgaaggg ttggacggga tgacctccag aagtcccttc aatttctagt 2460 acctgtgact cttagccctc accacagcct tctaaattcc caaatcctag actgctcctg 2520 ggcattagca aggcagagcc tttttacctg gcctagaaag ggcaaggggt gaggatagga 2580 cagagggatt ttgttcaagt ttgctgcaac ccaagtggac gttaggccag gccttatctg 2640 aaaggccagc agctgatgct gtactaaccc agtctttctt cactctggct tcaaaaagcc 2700 acagcagagc attgtcaccg caggtcccca tgctgctccc ctaaagccag gctcaggaga 2760 agccagtgtc taggcactga gcagggatct gccccctagt tcaggtccaa attcaccttc 2820 ccctaaaccc caagcttccc aacagatcat atggtaggac cctcgagagc cttacttcaa 2880 agtgcctggg ctcagcctgg tttctgggtg ctagatccag cccaaacctg ggaaggccag 2940 ccttgtacag tctgctcctc ttgttcctga aatgtgtttc cttttcagga gatggggaat 3000 aatttccttc aggcagctga aattcaccaa gaacagcggg tacttatttc tcaagctgtg 3060 ccttcccttt ctaagcaacc acactgcttg gcccttcaag ggtcagggtg agacgtgatg 3120 ggctaggcct ccgttgtctg gttgctaatg acagccttgc aacccaaggt gaggtgaact 3180 ccaggcatgt gtctggccct aactcctata aagtgcctcg gacagtccgc agttgtagca 3240 gaaaccaaca agaaccactc cttcatgttt ggaaaataat ttctcttgta ttatctcctt 3300 tgaagaaggc aaggctgata atatgacaaa catcattgtt tagatgaggc tcagagaggt 3360 agcactctca gagtgttttg accagtttaa gccgcagacc tggagcttca gccaggtctg 3420 actccaaagc tgttccatta caccacagca ttgtgtggaa tttgaggtct agagagaacc 3480 aataaaagtg gtaattggga actgaaatcc ttgagagttc cggggagaaa cccagagatg 3540 cctgatttca ttcctcgatg gtaatacccg tcctctcggc tgccaggggc tctgtggcaa 3600 aaagagtcag acatttcttt ggaaaacagc gaacagcctt agagctcttg tgttcagaag 3660 aatcttcctg gcacaatgtt ggagcagcag gcctctggga cccacagaac ttgtggcctt 3720 tatgttcttt cacccatcct aggaaccagc caaccatcat gtgtagagcc cctactgtgg 3780 gcaaagtcct cctttcatta ccctacagac agcttacagg agccagcctg cttcccacaa 3840 ctactagtgt gactccttat ctctttccac cataccttag agactttgat actaccaggg 3900 tctctcaggg atggagggaa gacctgaaag agaggactgg ttctgaggcc agaaaggtgt 3960 gaggagagag gaggaaaagt cttcctaatt gtgcccctaa agagcatcct gataccattc 4020 tattctccag acatggaggg gatgataaag gaaataggat ctccactgga cccttgattc 4080 attctgaacc ctccaaagga actctaagag ggcgagggat gatgagggaa gcaataggta 4140 gctggggagc cctattgctg ctaagtcatt ggcaaagtgc aaagcaattt actgatgaga 4200 gaatgtggaa atagatgtgc agtttggaat tatgttggtg tnaatttgcc agaggaccaa 4260 tgcttgcatg gagaatggac gaggacattt gtgggcaagc agatgacaga ggtttgaagg 4320 agaatggcat ggcaggagtc tctgccagtt acttgggctt caacagccaa gctggcacaa 4380 aagacagctg gcggaggctg ctcggctact ggttacctgg agaagtagta tttgcctatt 4440 tcccccttca tccatcctga gccaaatttc ntttgctgaa caggaaagag cyaggaaccc 4500 tggaggtaaa caaagacttt gancctgtnt nagtgtatgt gtttntgtaa cttcctgtgg 4560 agtgcaaata gattcagaga aatttagagc taaaaaggcc cttagaggga atctagccca 4620 acctacattc caccctgtta cttatgtaga aactgaggcc cagagaggga agatgacctg 4680 ccccaagtgg tgagcaagca ccaacctcca gactcagcag agtgaggggg taaagcagtt 4740 cctgtcccac atggccatct tctttcttcc acccacaaac tccaggctgg aagtacttgg 4800 cccccttcag gagcctggcc aggcagggag agagtagctg cagccttcat cagaactctt 4860 cctcctccca aggcattctc ccagctctag cctctggact ggaaagcaca agactggccc 4920 agtgccagca agtccttagg ctactgtaat gctgcctcag gacccatccc tgcctggagg 4980 ctcctctagg ccctgtgagc acaaagaaga aagctgattt ttgtctttta atccatttca 5040 ggactctctc caggagggct cggggtgtgt catttctata ttcctccagc tgggattggg 5100 gggtgggctt tgttgtgaga atggcctgga gcaggcccaa tgctgctttt gggggtcagc 5160 atccagtgtg agatactgtg tatataaact atatataatg tatataaact gggatgtaag 5220 tttgtgtaaa ttaatgtttt attctttgca aataaaacgc tttccccgtc aaaaaaaaaa 5280 aaaaaaaaaa aaaaaaaaaa aaaaaaa 5307 <210> SEQ ID NO 7 <211> LENGTH: 972 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 7 tctagactca ggactgagaa gaagtaaaac cgtttgctgg ggctggcctg actcaccagc 60 tgccatgcag cagcccttca attacccata tccccagatc tactgggtgg acagcagtgc 120 cagctctccc tgggcccctc caggcacagt tcttccctgt ccaacctctg tgcccagaag 180 gcctggtcaa aggaggccac caccaccacc gccaccgcca ccactaccac ctccgccgcc 240 gccgccacca ctgcctccac taccgctgcc acccctgaag aagagaggga accacagcac 300 aggcctgtgt ctccttgtga tgtttttcat ggttctggtt gccttggtag gattgggcct 360 ggggatgttt cagctcttcc acctacagaa ggagctggca gaactccgag agtctaccag 420 ccagatgcac acagcatcat ctttggagaa gcaaataggc caccccagtc caccccctga 480 aaaaaaggag ctgaggaaag tggcccattt aacaggcaag tccaactcaa ggtccatgcc 540 tctggaatgg gaagacacct atggaattgt cctgctttct ggagtgaagt ataagaaggg 600 tggccttgtg atcaatgaaa ctgggctgta ctttgtatat tccaaagtat acttccgggg 660 tcaatcttgc aacaacctgc ccctgagcca caaggtctac atgaggaact ctaagtatcc 720 ccaggatctg gtgatgatgg aggggaagat gatgagctac tgcactactg ggcagatgtg 780 ggcccgcagc agctacctgg gggcagtgtt caatcttacc agtgctgatc atttatatgt 840 caacgtatct gagctctctc tggtcaattt tgaggaatct cagacgtttt tcggcttata 900 taagctctaa gagaagcact ttgggattct ttccattatg attctttgtt acaggcaccg 960 agatgttcta ga 972 <210> SEQ ID NO 8 <211> LENGTH: 3362 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 8 ccatatcttc atcttccctc tacccagatt gtgaagatgg aaagggtcca acccctggaa 60 gagaatgtgg gaaatgcagc caggccaaga ttcgagagga acaagctatt gctggtggcc 120 tctgtaattc agggactggg gctgctcctg tgcttcacct acatctgcct gcacttctct 180 gctcttcagg tatcacatcg gtatcctcga attcaaagta tcaaagtaca atttaccgaa 240 tataagaagg agaaaggttt catcctcact tcccaaaagg aggatgaaat catgaaggtg 300 cagaacaact cagtcatcat caactgtgat gggttttatc tcatctccct gaagggctac 360 ttctcccagg aagtcaacat tagccttcat taccagaagg atgaggagcc cctcttccaa 420 ctgaagaagg tcaggtctgt caactccttg atggtggcct ctctgactta caaagacaaa 480 gtctacttga atgtgaccac tgacaatacc tccctggatg acttccatgt gaatggcgga 540 gaactgattc ttatccatca aaatcctggt gaattctgtg tcctttgagg ggctgatggc 600 aatatctaaa accaggcacc agcatgaaca ccaagctggg ggtggacagg gcatggattc 660 ttcattgcaa gtgaaggagc ctcccagctc agccacgtgg gatgtgacaa gaagcagatc 720 ctggccctcc cgcccccacc cctcagggat atttaaaact tattttatat accagttaat 780 cttatttatc cttatatttt ctaaattgcc tagccgtcac accccaagat tgccttgagc 840 ctactaggca cctttgtgag aaagaaaaaa tagatgcctc ttcttcaaga tgcattgttt 900 ctattggtca ggcaattgtc ataataaact tatgtcattg aaaacggtac ctgactacca 960 tttgctggaa atttgacatg tgtgtggcat tatcaaaatg aagaggagca aggagtgaag 1020 gagtggggtt atgaatctgc caaaggtggt atgaaccaac ccctggaagc caaagcggcc 1080 tctccaaggt taaattgatt gcagtttgca tattgcctaa atttaaactt tctcatttgg 1140 tgggggttca aaagaagaat cagcttgtga aaaatcagga cttgaagaga gccgtctaag 1200 aaataccacg tgcttttttt ctttaccatt ttgctttccc agcctccaaa catagttaat 1260 agaaatttcc cttcaaagaa ctgtctgggg atgtgatgct ttgaaaaatc taatcagtga 1320 cttaagagag attttcttgt atacagggag agtgagataa cttattgtga agggttagct 1380 ttactgtaca ggatagcagg gaactggaca tctcagggta aaagtcagta cggattttaa 1440 tagcctgggg aggaaaacac attctttgcc acagacaggc aaagcaacac atgctcatcc 1500 tcctgcctat gctgagatac gcactcagct ccatgtcttg tacacacaga aacattgctg 1560 gtttcaagaa atgaggtgat cctattatca aattcaatct gatgtcaaat agcactaaga 1620 agttattgtg ccttatgaaa aataatgatc tctgtctaga aataccatag accatatata 1680 gtctcacatt gataattgaa actagaaggg tctatatcag cctatgccag ggcttcaatg 1740 gaatagtatc cccttatgtt tagttgaaat gtccccttaa cttgatataa tgtgttatgc 1800 ttatggcgct gtgacaatct gatttttcat gtcaacttcc agatgatttg taacttctct 1860 gtgccaaacc ttttataaac ataaattttt gagatatgta ttttaaaatt gtagcacatg 1920 tttccctgac attttcaata gaggatacaa catcacagaa tctttctgga tgattctgtg 1980 ttatcaagga attgtactgt gctacaatta tctctagaat ctccagaaag gtggagggct 2040 gttcgccctt acactaaatg gtctcagttg gatttttttt tcctgttttc tatttcctct 2100 taagtacacc ttcaactata ttcccatccc tctattttaa tctgttatga aggaaggtaa 2160 ataaaaatgc taaatagaag aaattgtagg taaggtaaga ggaatcaagt tctgagtggc 2220 tgccaaggca ctcacagaat cataatcatg gctaaatatt tatggagggc ctactgtgga 2280 ccaggcactg gctaaatact tacatttaca agaatcattc tgagacagat attcaatgat 2340 atctggcttc actactcaga agattgtgtg tgtgtttgtg tgtgtgtgtg tgtgtgtatt 2400 tcactttttg ttattgacca tgttctgcaa aattgcagtt actcagtgag tgatatccga 2460 aaaagtaaac gtttatgact ataggtaata tttaagaaaa tgcatggttc atttttaagt 2520 ttggaatttt tatctatatt tctcacagat gtgcagtgca catgcaggcc taagtatatg 2580 ttgtgtgtgt ttgtctttga cgtcatggtc ccctctctta ggtgctcact cgctttgggt 2640 gcacctggcc tgctcttccc atgttggcct ctgcaaccac acagggatat ttctgctatg 2700 caccagcctc actccacctt ccttccatca aaaatatgtg tgtgtgtctc agtccctgta 2760 agtcatgtcc ttcacaggga gaattaaccc ttcgatatac atggcagagt tttgtgggaa 2820 aagaattgaa tgaaaagtca ggagatcaga attttaaatt tgacttagcc actaactagc 2880 catgtaacct tgggaaagtc atttcccatt tctgggtctt gcttttcttt ctgttaaatg 2940 agaggaatgt taaatatcta acagtttaga atcttatgct tacagtgtta tctgtgaatg 3000 cacatattaa atgtctatgt tcttgttgct atgagtcaag gagtgtacac ttctccttta 3060 ctatgttgaa tgtatttttt tctggacaag cttacatctt cctcagccat ctttgtgagt 3120 ccttcaagag cagttatcaa ttgttagtta gatattttct atttagagaa tgcttaaggg 3180 attccaatcc cgatccaaat cataatttgt tcttaagtat actgggcagg tcccctattt 3240 taagtcataa ttttgtattt agtgctttcc tggctctcag agagtattaa tattgatatt 3300 aataatatag ttaatagtaa tattgctatt tacatggaaa caaataaaag atctcagaat 3360 tc 3362 <210> SEQ ID NO 9 <211> LENGTH: 534 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 9 atgtgtttga gccacttgga aaatatgcct ttaagccatt caagaactca aggagctcag 60 agatcatcct ggaagctgtg gctcttttgc tcaatagtta tgttgctatt tctttgctcc 120 ttcagttggc taatctttat ttttctccaa ttagagactg ctaaggagcc ctgtatggct 180 aagtttggac cattaccctc aaaatggcaa atggcatctt ctgaacctcc ttgcgtgaat 240 aaggtgtctg actggaagct ggagatactt cagaatggct tatatttaat ttatggccaa 300 gtggctccca atgcaaacta caatgatgta gctccttttg aggtgcggct gtataaaaac 360 aaagacatga tacaaactct aacaaacaaa tctaaaatcc aaaatgtagg agggacttat 420 gaattgcatg ttggggacac catagacttg atattcaact ctgagcatca ggttctaaaa 480 aataatacat actggggtat cattttacta gcaaatcccc aattcatctc ctag 534 <210> SEQ ID NO 10 <211> LENGTH: 1906 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 10 ccaagtcaca tgattcagga ttcaggggga gaatccttct tggaacagag atgggcccag 60 aactgaatca gatgaagaga gataaggtgt gatgtgggga agactatata aagaatggac 120 ccagggctgc agcaagcact caacggaatg gcccctcctg gagacacagc catgcatgtg 180 ccggcgggct ccgtggccag ccacctgggg accacgagcc gcagctattt ctatttgacc 240 acagccactc tggctctgtg ccttgtcttc acggtggcca ctattatggt gttggtcgtt 300 cagaggacgg actccattcc caactcacct gacaacgtcc ccctcaaagg aggaaattgc 360 tcagaagacc tcttatgtat cctgaaaaga gctccattca agaagtcatg ggcctacctc 420 caagtggcaa agcatctaaa caaaaccaag ttgtcttgga acaaagatgg cattctccat 480 ggagtcagat atcaggatgg gaatctggtg atccaattcc ctggtttgta cttcatcatt 540 tgccaactgc agtttcttgt acaatgccca aataattctg tcgatctgaa gttggagctt 600 ctcatcaaca agcatatcaa aaaacaggcc ctggtgacag tgtgtgagtc tggaatgcaa 660 acgaaacacg tataccagaa tctctctcaa ttcttgctgg attacctgca ggtcaacacc 720 accatatcag tcaatgtgga tacattccag tacatagata caagcacctt tcctcttgag 780 aatgtgttgt ccatcttctt atacagtaat tcagactgaa cagtttctct tggccttcag 840 gaagaaagcg cctctctacc atacagtatt tcatccctcc aaacacttgg gcaaaaagaa 900 aactttagac caagacaaac tacacagggt attaaatagt atacttctcc ttctgtctct 960 tggaaagata cagctccagg gttaaaaaga gagtttttag tgaagtatct ttcagatagc 1020 aggcagggaa gcaatgtagt gtggtgggca gagccccaca cagaatcaga agggatgaat 1080 ggatgtccca gcccaaccac taattcactg tatggtcttg atctatttct tctgttttga 1140 gagcctccag ttaaaatggg gcttcagtac cagagcagct agcaactctg ccctaatggg 1200 aaatgaaggg gagctgggtg tgagtgttta cactgtgccc ttcacgggat acttctttta 1260 tctgcagatg gcctaatgct tagttgtcca agtcgcgatc aaggactctc tcacacagga 1320 aacttcccta tactggcaga tacacttgtg actgaaccat gcccagttta tgcctgtctg 1380 actgtcactc tggcactagg aggctgatct tgtactccat atgaccccac ccctaggaac 1440 ccccagggaa aaccaggctc ggacagcccc ctgttcctga gatggaaagc acaaatttaa 1500 tacaccacca caatggaaaa caagttcaaa gacttttact tacagatcct ggacagaaag 1560 ggcataatga gtctgaaggg cagtcctcct tctccaggtt acatgaggca ggaataagaa 1620 gtcagacaga gacagcaaga cagttaacaa cgtaggtaaa gaaatagggt gtggtcactc 1680 tcaattcact ggcaaatgcc tgaatggtct gtctgaagga agcaacagag aagtggggaa 1740 tccagtctgc taggcaggaa agatgcctct aagttcttgt ctctggccag aggtgtggta 1800 tagaaccaga aacccatatc aagggtgact aagcccggct tccggtatga gaaattaaac 1860 ttgtatacaa aatggttgcc aaggcaacat aaaattataa gaattc 1906 <210> SEQ ID NO 11 <211> LENGTH: 2785 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: (49)..(49) <223> OTHER INFORMATION: n = a, t, c or g <400> SEQUENCE: 11 agtgcagtat ctcatggagg tgtttggatg tctcttcctg tggggggtnc caaagcccat 60 gtctcttggc attttctttc agattctatc agccctctct ctttctctcc tgtctctctc 120 tttcattcat acactgagtc attcagagat ggcttctctc caactcggag ctgcaagtaa 180 ttctggatct ggtcacacac acaaagtccc cagagttgcc aatttatcta gttcatctgt 240 gcctgttcaa gatgatgtaa ctaaacattt accttcaggg aggtgtttcc aaagaatttt 300 catcgatata tagaaatcaa gagaaaatcc atactatcac caaatcaaga gaaattccat 360 actatcacca gttggccaac tttccaagtc tagtgcagaa atccaaggca cctcacacct 420 agagttccta tacctctgag actccagagg aaagaacaag acagtgcaga aggatatgtt 480 agaacccact gaaaacctag aaggttaaaa aggaagcata ccctcctgac ctataagaaa 540 attttcagtc tgcaggggga tatccttgtg gcccaagaca ttggtgttat catttgacta 600 agaggaaatt atttgtggtg agctctgagt gaggattagg accagggaga tgccaagttt 660 ctatcactta cctcatgcct gtaagacaag tgttttgttc caattgatga atggggataa 720 aacagttcag ccaatcactt atggggcaaa gaatgggaat ttgaagggtc tggtgcctgg 780 ccttgtcata cgtaaacaag agaggcatcg atgagtttta tctgagtcat ttgggaaagg 840 ataattcttg cagcaagcca ttttcctaaa cacagaagaa tagggggatt ccttaacctt 900 cattgttctc caggatcata ggtctcaggt aaaattaaaa attttcaggt cagaccactc 960 agtctcagaa aggcaaagta atttgcccca ggtcactagt ccaagatgtt attctctttg 1020 aacaaatgtg tatgtccagt cacatattct tcattcattc ctccccaaag cagtttttag 1080 ctgttaggta tattcgatca ctttagtcta ttttgaaaat gatatgagac gctttttaag 1140 caaagtctac agtttcccaa tgagaaaatt aatcctcttt cttgtctttc cagttgtgag 1200 acaaactccc acacagcact ttaaaaatca gttcccagct ctgcactggg aacatgaact 1260 aggcctggcc ttcaccaaga accgaatgaa ctataccaac aaattcctgc tgatcccaga 1320 gtcgggagac tacttcattt actcccaggt cacattccgt gggatgacct ctgagtgcag 1380 tgaaatcaga caagcaggcc gaccaaacaa gccagactcc atcactgtgg tcatcaccaa 1440 ggtaacagac agctaccctg agccaaccca gctcctcatg gggaccaagt ctgtatgcga 1500 agtaggtagc aactggttcc agcccatcta cctcggagcc atgttctcct tgcaagaagg 1560 ggacaagcta atggtgaacg tcagtgacat ctctttggtg gattacacaa aagaagataa 1620 aaccttcttt ggagccttct tactatagga ggagagcaaa tatcattata tgaaagtcct 1680 ctgccaccga gttcctaatt ttctttgttc aaatgtaatt ataaccaggg gttttcttgg 1740 ggccgggagt agggggcatt ccacagggac aacggtttag ctatgaaatt tggggccaaa 1800 atttcacact tcatgtgcct tactgatgag agtactaact ggaaaaaggc tgaagagagc 1860 aaatatatta ttaagatggg ttggaggatt ggcgagtttc taaatattaa gacactgatc 1920 actaaatgaa tggatgatct actcgggtca ggattgaaag agaaatattt caacacctcc 1980 tgctatacaa tggtcaccag tggtccagtt attgttcaat ttgatcataa atttgcttca 2040 attcaggagc tttgaaggaa gtccaaggaa agctctagaa aacagtataa actttcagag 2100 gcaaaatcct tcaccaattt ttccacatac tttcatgcct tgcctaaaaa aaatgaaaag 2160 agagttggta tgtctcatga atgttcacac agaaggagtt ggttttcatg tcatctacag 2220 catatgagaa aagctacctt tcttttgatt atgtacacag atatctaaat aaggaagtat 2280 gagtttcaca tgtatatcaa aaatacaaca gttgcttgta ttcagtagag ttttcttgcc 2340 cacctatttt gtgctgggtt ctaccttaac ccagaagaca ctatgaaaaa caagacagac 2400 tccactcaaa atttatatga acaccactag atacttcctg atcaaacatc agtcaacata 2460 ctctaaagaa taactccaag tcttggccag gcgcagtggc tcacacctgt aatcccaaca 2520 ctttgggagg ccaaggtggg tggatcatct aaggccggga gttcaagacc agcctgacca 2580 acgtggagaa accccatctc tactaaaaat acaaaattag ccgggcgtgg tagcgcatgg 2640 ctgtaatcct ggctactcag gaggccgagg cagaagaatt gcttgaactg gggaggcaga 2700 ggttgcggtg agcccagatc gcgccattgc actccagcct gggtaacaag agcaaaactc 2760 tgtccaaaaa aaaaaaaaaa aaaaa 2785 <210> SEQ ID NO 12 <211> LENGTH: 1169 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 12 gaggttgaag gacccaggcg tgtcagccct gctccagaga ccttgggcat ggaggagagt 60 gtcgtacggc cctcagtgtt tgtggtggat ggacagaccg acatcccatt cacgaggctg 120 ggacgaagcc accggagaca gtcgtgcagt gtggcccggg tgggtctggg tctcttgctg 180 ttgctgatgg gggctgggct ggccgtccaa ggctggttcc tcctgcagct gcactggcgt 240 ctaggagaga tggtcacccg cctgcctgac ggacctgcag gctcctggga gcagctgata 300 caagagcgaa ggtctcacga ggtcaaccca gcagcgcatc tcacaggggc caactccagc 360 ttgaccggca gcggggggcc gctgttatgg gagactcagc tgggcctggc cttcctgagg 420 ggcctcagct accacgatgg ggcccttgtg gtcaccaaag ctggctacta ctacatctac 480 tccaaggtgc agctgggcgg tgtgggctgc ccgctgggcc tggccagcac catcacccac 540 ggcctctaca agcgcacacc ccgctacccc gaggagctgg agctgttggt cagccagcag 600 tcaccctgcg gacgggccac cagcagctcc cgggtctggt gggacagcag cttcctgggt 660 ggtgtggtac acctggaggc tggggaggag gtggtcgtcc gtgtgctgga tgaacgcctg 720 gttcgactgc gtgatggtac ccggtcttac ttcggggctt tcatggtgtg aaggaaggag 780 cgtggtgcat tggacatggg tctgacacgt ggagaactca gagggtgcct caggggaaag 840 aaaactcacg aagcagaggc tgggcgtggt ggctctcgcc tgtaatccca gcactttggg 900 aggccaaggc aggcggatca cctgaggtca ggagttcgag accagcctgg ctaacatggc 960 aaaaccccat ctctactaaa aatacaaaaa ttagccggac gtggtggtgc ctgcctgtaa 1020 tccagctact caggaggctg aggcaggata attttgctta aacccgggag gcggaggttg 1080 cagtgagccg agatcacacc actgcactcc aacctgggaa acgcagtgag actgtgcctc 1140 aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 1169 <210> SEQ ID NO 13 <211> LENGTH: 1619 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 13 gtcatggaat acgcctctga cgcttcactg gaccccgaag ccccgtggcc tcccgcgccc 60 cgcgctcgcg cctgccgcgt actgccttgg gccctggtcg cggggctgct gctgctgctg 120 ctgctcgctg ccgcctgcgc cgtcttcctc gcctgcccct gggccgtgtc cggggctcgc 180 gcctcgcccg gctccgcggc cagcccgaga ctccgcgagg gtcccgagct ttcgcccgac 240 gatcccgccg gcctcttgga cctgcggcag ggcatgtttg cgcagctggt ggcccaaaat 300 gttctgctga tcgatgggcc cctgagctgg tacagtgacc caggcctggc aggcgtgtcc 360 ctgacggggg gcctgagcta caaagaggac acgaaggagc tggtggtggc caaggctgga 420 gtctactatg tcttctttca actagagctg cggcgcgtgg tggccggcga gggctcaggc 480 tccgtttcac ttgcgctgca cctgcagcca ctgcgctctg ctgctggggc cgccgccctg 540 gctttgaccg tggacctgcc acccgcctcc tccgaggctc ggaactcggc cttcggtttc 600 cagggccgct tgctgcacct gagtgccggc cagcgcctgg gcgtccatct tcacactgag 660 gccagggcac gccatgcctg gcagcttacc cagggcgcca cagtcttggg actcttccgg 720 gtgacccccg aaatcccagc cggactccct tcaccgaggt cggaataacg cccagcctgg 780 gtgcagccca cctggacaga gtccgaatcc tactccatcc ttcatggaga cccctggtgc 840 tgggtccctg ctgctttctc tacctcaagg ggcttggcag gggtccctgc tgctgacctc 900 cccttgagga ccctcctcac ccactccttc cccaagttgg accttgatat ttattctgag 960 cctgagctca gataatatat tatatatatt atatatatat atatatttct atttaaagag 1020 gatcctgagt ttgtgaatgg acttttttag aggagttgtt ttgggggggg ggtcttcgac 1080 attgccgagg ctggtcttga actcctggac ttagacgatc ctcctgcctc agcctcccaa 1140 gcaactggga ttcatccttt ctattaattc attgtactta tttgcctatt tgtgtgtatt 1200 gagcatctgt aatgtgccag cattgtgccc aggctagggg gctatagaaa catctagaaa 1260 tagactgaaa gaaaatctga gttatggtaa tacgtgagga atttaaagac tcatccccag 1320 cctccacctc ctgtgtgata cttgggggct agcttttttc tttctttctt ttttttgaga 1380 tggtcttgtt ctgtcaacca ggctagaatg cagcggtgca atcatgagtc aatgcagcct 1440 ccagcctcga cctcccgagg ctcaggtgat cctcccatct cagcctctcg agtagctggg 1500 accacagttg tgtgccacca cacttggcta actttttaat ttttttgcgg agacggtatt 1560 gctatgttgc caaggttgtt tacatgccag tacaatttat aataaacact catttttcc 1619 <210> SEQ ID NO 14 <211> LENGTH: 926 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 14 ccagagaggg gcaggcttgt cccctgacag gttgaagcaa gtagacgccc aggagccccg 60 ggagggggct gcagtttcct tccttccttc tcggcagcgc tccgcgcccc catcgcccct 120 cctgcgctag cggaggtgat cgccgcggcg atgccggagg agggttcggg ctgctcggtg 180 cggcgcaggc cctatgggtg cgtcctgcgg gctgctttgg tcccattggt cgcgggcttg 240 gtgatctgcc tcgtggtgtg catccagcgc ttcgcacagg ctcagcagca gctgccgctc 300 gagtcacttg ggtgggacgt agctgagctg cagctgaatc acacaggacc tcagcaggac 360 cccaggctat actggcaggg gggcccagca ctgggccgct ccttcctgca tggaccagag 420 ctggacaagg ggcagctacg tatccatcgt gatggcatct acatggtaca catccaggtg 480 acgctggcca tctgctcctc cacgacggcc tccaggcacc accccaccac cctggccgtg 540 ggaatctgct ctcccgcctc ccgtagcatc agcctgctgc gtctcagctt ccaccaaggt 600 tgtaccattg tctcccagcg cctgacgccc ctggcccgag gggacacact ctgcaccaac 660 ctcactggga cacttttgcc ttcccgaaac actgatgaga ccttctttgg agtgcagtgg 720 gtgcgcccct gaccactgct gctgattagg gttttttaaa ttttatttta ttttatttaa 780 gttcaagaga aaaagtgtac acacaggggc cacccggggt tggggtggga gtgtggtggg 840 gggtagtttg tggcaggaca agagaaggca ttgagctttt tctttcattt tcctattaaa 900 aaatacaaaa atcaaaacaa aaaaaa 926 <210> SEQ ID NO 15 <211> LENGTH: 894 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 15 cagtctcaat gggggcactg gggctggagg gcaggggtgg gaggctccag gggaggggtt 60 ccctcctgct agctgtggca ggagccactt ctctggtgac cttgttgctg gcggtgccta 120 tcactgtcct ggctgtgctg gccttagtgc cccaggatca gggaggactg gtaacggaga 180 cggccgaccc cggggcacag gcccagcaag gactggggtt tcagaagctg ccagaggagg 240 agccagaaac agatctcagc cccgggctcc cagctgccca cctcataggc gctccgctga 300 aggggcaggg gctaggctgg gagacgacga aggaacaggc gtttctgacg agcgggacgc 360 agttctcgga cgccgagggg ctggcgctcc cgcaggacgg cctctattac ctctactgtc 420 tcgtcggcta ccggggccgg gcgccccctg gcggcgggga cccccagggc cgctcggtca 480 cgctgcgcag ctctctgtac cgggcggggg gcgcctacgg gccgggcact cccgagctgc 540 tgctcgaggg cgccgagacg gtgactccag tgctggaccc ggccaggaga caagggtacg 600 ggcctctctg gtacacgagc gtggggttcg gcggcctggt gcagctccgg aggggcgaga 660 gggtgtacgt caacatcagt caccccgata tggtggactt cgcgagaggg aagaccttct 720 ttggggccgt gatggtgggg tgagggaata tgagtgcgtg gtgcgagtgc gtgaatattg 780 ggggcccgga cgcccaggac cccatggcag tgggaaaaat gtaggagact gtttggaaat 840 tgattttgaa cctgatgaaa ataaagaatg gaaagcttca gtgctgccga taaa 894 <210> SEQ ID NO 16 <211> LENGTH: 1306 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 16 cacagccccc cgcccccatg gccgcccgtc ggagccagag gcggaggggg cgccgggggg 60 agccgggcac cgccctgctg gtcccgctcg cgctgggcct gggcctggcg ctggcctgcc 120 tcggcctcct gctggccgtg gtcagtttgg ggagccgggc atcgctgtcc gcccaggagc 180 ctgcccagga ggagctggtg gcagaggagg accaggaccc gtcggaactg aatccccaga 240 cagaagaaag ccaggatcct gcgcctttcc tgaaccgact agttcggcct cgcagaagtg 300 cacctaaagg ccggaaaaca cgggctcgaa gagcgatcgc agcccattat gaagttcatc 360 cacgacctgg acaggacgga gcgcaggcag gtgtggacgg gacagtgagt ggctgggagg 420 aagccagaat caacagctcc agccctctgc gctacaaccg ccagatcggg gagtttatag 480 tcacccgggc tgggctctac tacctgtact gtcaggtgca ctttgatgag gggaaggctg 540 tctacctgaa gctggacttg ctggtggatg gtgtgctggc cctgcgctgc ctggaggaat 600 tctcagccac tgcggccagt tccctcgggc cccagctccg cctctgccag gtgtctgggc 660 tgttggccct gcggccaggg tcctccctgc ggatccgcac cctcccctgg gcccatctca 720 aggctgcccc cttcctcacc tacttcggac tcttccaggt tcactgaggg gccctggtct 780 ccccacagtc gtcccaggct gccggctccc ctcgacagct ctctgggcac ccggtcccct 840 ctgccccacc ctcagccgct ctttgctcca gacctgcccc tccctctaga ggctgcctgg 900 gcctgttcac gtgttttcca tcccacataa atacagtatt cccactctta tcttacaact 960 cccccaccgc ccactctcca cctcactagc tccccaatcc ctgacccttt gaggccccca 1020 gtgatctcga ctcccccctg gccacagacc cccagggcat tgtgttcact gtactctgtg 1080 ggcaaggatg ggtccagaag accccacttc aggcactaag aggggctgga cctggcggca 1140 ggaagccaaa gagactgggc ctaggccagg agttcccaaa tgtgaggggc gagaaacaag 1200 acaagctcct cccttgagaa ttccctgtgg atttttaaaa cagatattat ttttattatt 1260 attgtgacaa aatgttgata aatggatatt aaatagaata agtcag 1306 <210> SEQ ID NO 17 <211> LENGTH: 1348 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 17 ggtacgaggc ttcctagagg gactggaacc taattctcct gaggctgagg gagggtggag 60 ggtctcaagg caacgctggc cccacgacgg agtgccagga gcactaacag tacccttagc 120 ttgctttcct cctccctcct ttttattttc aagttccttt ttatttctcc ttgcgtaaca 180 accttcttcc cttctgcacc actgcccgta cccttacccg ccccgccacc tccttgctac 240 cccactcttg aaaccacagc tgttggcagg gtccccagct catgccagcc tcatctcctt 300 tcttgctagc ccccaaaggg cctccaggca acatgggggg cccagtcaga gagccggcac 360 tctcagttgc cctctggttg agttgggggg cagctctggg ggccgtggct tgtgccatgg 420 ctctgctgac ccaacaaaca gagctgcaga gcctcaggag agaggtgagc cggctgcagg 480 ggacaggagg cccctcccag aatggggaag ggtatccctg gcagagtctc ccggagcaga 540 gttccgatgc cctggaagcc tgggagaatg gggagagatc ccggaaaagg agagcagtgc 600 tcacccaaaa acagaagaag cagcactctg tcctgcacct ggttcccatt aacgccacct 660 ccaaggatga ctccgatgtg acagaggtga tgtggcaacc agctcttagg cgtgggagag 720 gcctacaggc ccaaggatat ggtgtccgaa tccaggatgc tggagtttat ctgctgtata 780 gccaggtcct gtttcaagac gtgactttca ccatgggtca ggtggtgtct cgagaaggcc 840 aaggaaggca ggagactcta ttccgatgta taagaagtat gccctcccac ccggaccggg 900 cctacaacag ctgctatagc gcaggtgtct tccatttaca ccaaggggat attctgagtg 960 tcataattcc ccgggcaagg gcgaaactta acctctctcc acatggaacc ttcctggggt 1020 ttgtgaaact gtgattgtgt tataaaaagt ggctcccagc ttggaagacc agggtgggta 1080 catactggag acagccaaga gctgagtata taaaggagag ggaatgtgca ggaacagagg 1140 catcttcctg ggtttggctc cccgttcctc acttttccct tttcattccc accccctaga 1200 ctttgatttt acggatatct tgcttctgtt ccccatggag ctccgaattc ttgcgtgtgt 1260 gtagatgagg ggcgggggac gggcgccagg cattgttcag acctggtcgg ggcccactgg 1320 aagcatccag aacagcacca ccatctta 1348 <210> SEQ ID NO 18 <211> LENGTH: 1090 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 18 taactctcct gaggggtgag ccaagccctg ccatgtagtg cacgcaggac atcaacaaac 60 acagataaca ggaaatgatc cattccctgt ggtcacttat tctaaaggcc ccaaccttca 120 aagttcaagt agtgatatgg atgactccac agaaagggag cagtcacgcc ttacttcttg 180 ccttaagaaa agagaagaaa tgaaactgaa ggagtgtgtt tccatcctcc cacggaagga 240 aagcccctct gtccgatcct ccaaagacgg aaagctgctg gctgcaacct tgctgctggc 300 actgctgtct tgctgcctca cggtggtgtc tttctaccag gtggccgccc tgcaagggga 360 cctggccagc ctccgggcag agctgcaggg ccaccacgcg gagaagctgc cagcaggagc 420 aggagccccc aaggccggcc tggaggaagc tccagctgtc accgcgggac tgaaaatctt 480 tgaaccacca gctccaggag aaggcaactc cagtcagaac agcagaaata agcgtgccgt 540 tcagggtcca gaagaaacag tcactcaaga ctgcttgcaa ctgattgcag acagtgaaac 600 accaactata caaaaaggat cttacacatt tgttccatgg cttctcagct ttaaaagggg 660 aagtgcccta gaagaaaaag agaataaaat attggtcaaa gaaactggtt acttttttat 720 atatggtcag gttttatata ctgataagac ctacgccatg ggacatctaa ttcagaggaa 780 gaaggtccat gtctttgggg atgaattgag tctggtgact ttgtttcgat gtattcaaaa 840 tatgcctgaa acactaccca ataattcctg ctattcagct ggcattgcaa aactggaaga 900 aggagatgaa ctccaacttg caataccaag agaaaatgca caaatatcac tggatggaga 960 tgtcacattt tttggtgcat tgaaactgct gtgacctact tacaccatgt ctgtagctat 1020 tttcctccct ttctctgtac ctctaagaag aaagaatcta actgaaaata ccaaaaaaaa 1080 aaaaaaaaaa 1090 <210> SEQ ID NO 19 <211> LENGTH: 316 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 19 Met Arg Arg Ala Ser Arg Asp Tyr Gly Lys Tyr Leu Arg Ser Ser Glu 1 5 10 15 Glu Met Gly Ser Gly Pro Gly Val Pro His Glu Gly Pro Leu His Pro 20 25 30 Ala Pro Ser Ala Pro Ala Pro Ala Pro Pro Pro Ala Ala Ser Arg Ser 35 40 45 Met Phe Leu Ala Leu Leu Gly Leu Gly Leu Gly Gln Val Val Cys Ser 50 55 60 Ile Ala Leu Phe Leu Tyr Phe Arg Ala Gln Met Asp Pro Asn Arg Ile 65 70 75 80 Ser Glu Asp Ser Thr His Cys Phe Tyr Arg Ile Leu Arg Leu His Glu 85 90 95 Asn Ala Gly Leu Gln Asp Ser Thr Leu Glu Ser Glu Asp Thr Leu Pro 100 105 110 Asp Ser Cys Arg Arg Met Lys Gln Ala Phe Gln Gly Ala Val Gln Lys 115 120 125 Glu Leu Gln His Ile Val Gly Pro Gln Arg Phe Ser Gly Ala Pro Ala 130 135 140 Met Met Glu Gly Ser Trp Leu Asp Val Ala Gln Arg Gly Lys Pro Glu 145 150 155 160 Ala Gln Pro Phe Ala His Leu Thr Ile Asn Ala Ala Ser Ile Pro Ser 165 170 175 Gly Ser His Lys Val Thr Leu Ser Ser Trp Tyr His Asp Arg Gly Trp 180 185 190 Ala Lys Ile Ser Asn Met Thr Leu Ser Asn Gly Lys Leu Arg Val Asn 195 200 205 Gln Asp Gly Phe Tyr Tyr Leu Tyr Ala Asn Ile Cys Phe Arg His His 210 215 220 Glu Thr Ser Gly Ser Val Pro Thr Asp Tyr Leu Gln Leu Met Val Tyr 225 230 235 240 Val Val Lys Thr Ser Ile Lys Ile Pro Ser Ser His Asn Leu Met Lys 245 250 255 Gly Gly Ser Thr Lys Asn Trp Ser Gly Asn Ser Glu Phe His Phe Tyr 260 265 270 Ser Ile Asn Val Gly Gly Phe Phe Lys Leu Arg Ala Gly Glu Glu Ile 275 280 285 Ser Ile Gln Val Ser Asn Pro Ser Leu Leu Asp Pro Asp Gln Asp Ala 290 295 300 Thr Tyr Phe Gly Ala Phe Lys Val Gln Asp Ile Asp 305 310 315 <210> SEQ ID NO 20 <211> LENGTH: 24 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 20 Asn Ala Ala Ser Ile Pro Ser Gly Ser His Lys Val Thr Leu Ser Ser 1 5 10 15 Trp Tyr His Asp Arg Gly Trp Ala 20 <210> SEQ ID NO 21 <211> LENGTH: 10 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 21 His Glu Thr Ser Gly Ser Val Pro Thr Asp 1 5 10 <210> SEQ ID NO 22 <211> LENGTH: 7 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 22 Ser Ile Lys Ile Pro Ser Ser 1 5 <210> SEQ ID NO 23 <211> LENGTH: 9 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 23 Lys Asn Trp Ser Gly Asn Ser Glu Phe 1 5 <210> SEQ ID NO 24 <211> LENGTH: 34 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 24 Thr Arg Gly Arg Ser Asn Thr Leu Ser Ser Pro Asn Ser Lys Asn Glu 1 5 10 15 Lys Ala Leu Gly Arg Lys Ile Asn Ser Trp Glu Ser Ser Arg Ser Gly 20 25 30 His Ser <210> SEQ ID NO 25 <211> LENGTH: 10 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 25 Gln Glu Glu Ile Lys Glu Asn Thr Lys Asn 1 5 10 <210> SEQ ID NO 26 <211> LENGTH: 6 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 26 Thr Ser Tyr Pro Asp Pro 1 5 <210> SEQ ID NO 27 <211> LENGTH: 9 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 27 Ser Cys Trp Ser Lys Asp Ala Glu Tyr 1 5 <210> SEQ ID NO 28 <211> LENGTH: 18 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 28 Glu Ala Ser Ser Lys Thr Thr Ser Val Leu Gln Trp Ala Glu Lys Gly 1 5 10 15 Tyr Tyr <210> SEQ ID NO 29 <211> LENGTH: 6 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 29 Asn Arg Glu Ala Ser Ser 1 5 <210> SEQ ID NO 30 <211> LENGTH: 6 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 30 Ser Pro Gly Arg Phe Glu 1 5 <210> SEQ ID NO 31 <211> LENGTH: 7 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 31 His Ser Ser Ala Lys Pro Cys 1 5 <210> SEQ ID NO 32 <211> LENGTH: 17 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 32 Asn His Gln Val Glu Glu Gln Leu Glu Trp Leu Ser Gln Arg Ala Asn 1 5 10 15 Ala <210> SEQ ID NO 33 <211> LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 33 Gln Gly Cys Pro Asp 1 5 <210> SEQ ID NO 34 <211> LENGTH: 7 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 34 Ala Ile Ser Tyr Gln Glu Lys 1 5 <210> SEQ ID NO 35 <211> LENGTH: 14 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 35 Pro Cys Pro Lys Asp Thr Pro Glu Gly Ala Glu Leu Lys Pro 1 5 10 <210> SEQ ID NO 36 <211> LENGTH: 17 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 36 Asp Pro Ser Lys Gln Asn Ser Leu Leu Trp Arg Ala Asn Thr Asp Arg 1 5 10 15 Ala <210> SEQ ID NO 37 <211> LENGTH: 10 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 37 Lys Ala Tyr Ser Pro Lys Ala Thr Ser Ser 1 5 10 <210> SEQ ID NO 38 <211> LENGTH: 7 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 38 Ser Ser Gln Tyr Pro Phe His 1 5 <210> SEQ ID NO 39 <211> LENGTH: 8 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 39 Val Tyr Pro Gly Leu Gln Glu Pro 1 5 <210> SEQ ID NO 40 <211> LENGTH: 17 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 40 Glu Thr Arg Val Thr Val Pro Asn Val Pro Ile Arg Phe Thr Lys Ile 1 5 10 15 Phe <210> SEQ ID NO 41 <211> LENGTH: 1 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 41 Asp 1 <210> SEQ ID NO 42 <211> LENGTH: 4 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 42 Tyr Gln Glu Lys 1 <210> SEQ ID NO 43 <211> LENGTH: 24 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 43 Asn Ala Thr Asp Ile Pro Ser Gly Ser His Lys Val Ser Leu Ser Ser 1 5 10 15 Trp Tyr His Asp Arg Gly Trp Ala 20 <210> SEQ ID NO 44 <211> LENGTH: 10 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 44 His Glu Thr Ser Gly Asp Leu Ala Thr Glu 1 5 10 <210> SEQ ID NO 45 <211> LENGTH: 7 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 45 Ser Ile Lys Ile Pro Ser Ser 1 5 <210> SEQ ID NO 46 <211> LENGTH: 9 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 46 Lys Tyr Trp Ser Gly Asn Ser Glu Phe 1 5 <210> SEQ ID NO 47 <211> LENGTH: 17 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 47 Asn Pro Gln Ala Glu Gly Gln Leu Gln Trp Leu Asn Arg Arg Ala Asn 1 5 10 15 Ala <210> SEQ ID NO 48 <211> LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 48 Gln Gly Cys Pro Ser 1 5 <210> SEQ ID NO 49 <211> LENGTH: 7 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 49 Ala Val Ser Tyr Gln Thr Lys 1 5 <210> SEQ ID NO 50 <211> LENGTH: 14 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 50 Pro Cys Gln Arg Glu Thr Pro Glu Gly Ala Glu Ala Lys Pro 1 5 10 <210> SEQ ID NO 51 <211> LENGTH: 2201 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 51 ggccaaagcc gggctccaag tcggcgcccc acgtcgaggc tccgccgcag cctccggagt 60 tggccgcaga caagaagggg agggagcggg agagggagga gagctccgaa gcgagagggc 120 cgagcgccat gcgccgcgcc agcagagact acaccaagta cctgcgtggc tcggaggaga 180 tgggcggcgg ccccggagcc ccgcacgagg gccccctgca cgccccgccg ccgcctgcgc 240 cgcaccagcc ccccgccgcc tcccgctcca tgttcgtggc cctcctgggg ctggggctgg 300 gccaggttgt ctgcagcgtc gccctgttct tctatttcag agcgcagatg gatcctaata 360 gaatatcaga agatggcact cactgcattt atagaatttt gagactccat gaaaatgcag 420 attttcaaga cacaactctg gagagtcaag atacaaaatt aatacctgat tcatgtagga 480 gaattaaaca ggcctttcaa ggagctgtgc aaaaggaatt acaacatatc gttggatcac 540 agcacatcag agcagagaaa gcgatggtgg atggctcatg gttagatctg gccaagagga 600 gcaagcttga agctcagcct tttgctcatc tcactattaa tgccaccgac atcccatctg 660 gttcccataa agtgagtctg tcctcttggt accatgatcg gggttgggcc aagatctcca 720 acatgacttt tagcaatgga aaactaatag ttaatcagga tggcttttat tacctgtatg 780 ccaacatttg ctttcgacat catgaaactt caggagacct agctacagag tatcttcaac 840 taatggtgta cgtcactaaa accagcatca aaatcccaag ttctcatacc ctgatgaaag 900 gaggaagcac caagtattgg tcagggaatt ctgaattcca tttttattcc ataaacgttg 960 gtggattttt taagttacgg tctggagagg aaatcagcat cgaggtctcc aacccctcct 1020 tactggatcc ggatcaggat gcaacatact ttggggcttt taaagttcga gatatagatt 1080 gagccccagt ttttggagtg ttatgtattt cctggatgtt tggaaacatt ttttaaaaca 1140 agccaagaaa gatgtatata ggtgtgtgag actactaaga ggcatggccc caacggtaca 1200 cgactcagta tccatgctct tgaccttgta gagaacacgc gtatttacct gccagtggga 1260 gatgttagac tcatggtgtg ttacacaatg gtttttaaat tttgtaatga attcctagaa 1320 ttaaaccaga ttggagcaat tacgggttga ccttatgaga aactgcatgt gggctatggg 1380 aggggttggt ccctggtcat gtgccccttc gcagctgaag tggagagggt gtcatctagc 1440 gcaattgaag gatcatctga aggggcaaat tcttttgaat tgttacatca tgctggaacc 1500 tgcaaaaaat actttttcta atgaggagag aaaatatatg tatttttata taatatctaa 1560 agttatattt cagatgtaat gttttctttg caaagtattg taaattatat ttgtgctata 1620 gtatttgatt caaaatattt aaaaatgtct tgctgttgac atatttaatg ttttaaatgt 1680 acagacatat ttaactggtg cactttgtaa attccctggg gaaaacttgc agctaaggag 1740 gggaaaaaaa tgttgtttcc taatatcaaa tgcagtatat ttcttcgttc tttttaagtt 1800 aatagatttt ttcagacttg tcaagcctgt gcaaaaaaat taaaatggat gccttgaata 1860 ataagcagga tgttggccac caggtgcctt tcaaatttag aaactaattg actttagaaa 1920 gctgacattg ccaaaaagga tacataatgg gccactgaaa tttgtcaaga gtagttatat 1980 aattgttgaa caggtgtttt tccacaagtg ccgcaaattg tacctttttt tttttttcaa 2040 aatagaaaag ttattagtgg tttatcagca aaaaagtcca attttaattt agtaaatgtt 2100 attttatact gtacaataaa aacattgcct ttgaatgtta attttttggt acaaaaataa 2160 atttatatga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 2201 

What is claimed is:
 1. An isolated polypeptide comprising one or more amino acid sequences selected from SEQ ID NO 1-SEQ ID NO 5 and SEQ ID NO: 7-SEQ ID NO: 11, said polypeptide having the ability to bind RANK.
 2. A polypeptide of claim 1 having conservative or non-conservative modifications and still having the ability to bind to RANK.
 3. A polypeptide comprising a RANKL sequence involved in binding to RANK, wherein said sequence consists essentially of one or more amino acid sequences selected from SEQ ID NO 1-SEQ ID NO 5 and SEQ ID NO: 7-SEQ ID NO: 11, and said polypeptide has the ability to bind to RANK.
 4. The polypeptide of claim 3 having conservative or non-conservative modifications and having the ability to inhibit RANKL binding with RANK.
 5. A polypeptide, other than RANKL, comprising one or more external surface loops of RANKL, said polypeptide having the ability to bind to RANK.
 6. A polypeptide comprising one or more external surface loops of RANKL, said polypeptide having the ability to competitively inhibit RANKL binding with RANK.
 7. The polypeptide of claim 1 consisting essentially of the amino acid sequence selected from SEQ ID NO 1-SEQ ID NO 5 and SEQ ID NO: 7-SEQ ID NO:
 11. 8. A compound comprising a fragment, analog, mimic or derivative of the polypeptide of claim 1, 2, 3, 4, 5, 6, or 7, said compound having the ability to bind to RANK.
 9. A pharmaceutical composition comprising an effective amount of the polypeptide of claim 1, 2, 3, 4, 5, 6, 7, in a pharmaceutically acceptable carrier, adjuvant, solubilizer, stabilizer, and/or anti-oxidant.
 10. A pharmaceutical composition comprising an effective amount of the compound of claim 8 in a pharmaceutically acceptable carrier, adjuvant, solubilizer, stabilizer, and/or anti-oxidant.
 11. A method of inhibiting osteoclast differentiation comprising administering an effective amount of the pharmaceutical composition of claim
 9. 12. A method of inhibiting bone resorption comprising administering an effective amount of the pharmaceutical composition of claim
 9. 13. A method of competitively inhibiting RANKL, said method comprising administering an effective amount of the compound of claim 1-8.
 14. A method of competitively inhibiting RANKL, said method comprising administering an effective amount of the pharmaceutical composition of claim
 9. 15. A method of treating diseases or conditions selected from the group comprising osteoporosis, juvenile osteoporosis, osteogenesis imperfecta, hypercalcemia, hyperparathyroidism, osteomalacia, osteohalisteresis, osteolytic bone disease, osteonecrosis, Paget's disease of bone, bone loss due to rheumatoid arthritis, inflammatory arthritis, osteomyelitis, corticosteroid treatment, metastatic bone diseases, periodontal bone loss, bone loss due to cancer, age-related loss of bone mass, and other forms of osteopenia, wherein the method comprises administering an effective amount of the pharmaceutical composition of claim
 9. 16. The method of claim 15, wherein the diseases or conditions comprise osteoporosis, osteolytic bone disease, rheumatoid arthritis, and skeletal metastasis.
 17. A polypeptide comprising (a) a RANKL sequence involved in binding to RANK, wherein said sequence consists essentially of at least one amino acid sequence selected from SEQ ID NO 1-SEQ ID NO 5 and SEQ ID NO: 7-SEQ ID NO: 11, and said polypeptide has the ability to bind to RANK; and (b) additional amino acid residues flanking said sequence that do not eliminate the ability of the sequence to bind RANK.
 18. The polypeptide of claim 1 having one or more of the following modifications: (i) one in which one or more amino acid residues of the polypeptide are substituted with a conserved or non-conserved amino acid residue, such residue may or may not be one encoded by the genetic code; (ii) one in which one or more of the amino acid residues includes a substituent group; (iii) one in which the polypeptide is fused to another compound such as a compound to increase the half-life of the protein; (iv) one in which additional amino acids are fused to the polypeptide to aid in purification or in detection and identification; or (v) one in which additional amino acid residues are fused to the polypeptide to aid in modifying tissue distribution or localization of the protein to certain locations such as the cell membrane or extracellular compartments.
 19. A method of screening for RANKL-binding compounds, said method comprising performing a binding assay between a test compound, a reference compound and RANK wherein said reference compound consists essentially of a polypeptide of at least one amino acid sequence selected from SEQ ID NO 1-SEQ ID NO 5 and SEQ ID NO: 7-SEQ ID NO:
 11. 20. A polynucleotide encoding the polypeptide of claim
 1. 21. A polynucleotide of claim 20 which exhibit regular degeneracy in accordance with the degeneracy of the genetic code.
 22. A polynucleotide of claim 20 encoding a polypeptide exhibiting conservative substitutions to the polypeptide of claim
 1. 23. An expression vector comprising the polynucleotide of claim 20, 21, or
 22. 24. A host cell containing an expression vector of claim
 23. 25. The method of treating diseases of claim 15 further comprising administering an effective amount of one or more pharmaceutical compositions for enhancing bone growth to a subject in need thereof. 